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THE EDITOR’S PAGE 


Integrating SSHRC’s New Programs 


As this is being written, members of parliament and cabinet ministers are being 
subjected to intensive lobbying in support of the Social Sciences and Humanities 
Research Council’s Five-Year Plan for Financing Research in the Social Sciences 
and Humanities, 1985-1990. Three features of the plan could have a substantial 
impact upon the Canadian educational research community. 

One is the inclusion of “education for the post-industrial society” among three 
new themes for targeted research deemed to be in the national interest. Research 
related to this theme is intended to help individual Canadians, and their educa- 
tional system, better meet the challenges of rapidly changing technology and mar- 
ket patterns. 

A second feature is the introduction of five-year “Canada Research Fellowships” 
to stimulate research activity immediately, and to provide a partial solution to the 
looming problem of staffing advanced education and research in the 1990s. 


The third is the creation of “research centres” to focus research on persisting 
national problems, promote interdisciplinary and multidisciplinary research, pro- 
vide research training, offer facilities for visiting researchers and the proposed Can- 
ada research fellows, act as nodes in communication networks, and link researchers 
and research users in government and the private sector. 

As this is being read, some of the decisions about the funding of the plan will 
have already been made by the federal government and the Council. The case in 
support of the three new features just enumerated is so compelling that it would be 
as surprising as it would be disastrous for them not to be pursued in some fashion. 

Equally compelling is the need to integrate research on education for the post- 
industrial society with the Canada research fellowships in a number of research 
centres. That integration maximizes impact has been demonstrated in connection 
with the study of Canada’s aging population during SSHRC’s first five-year plan. 
But integration does not just happen. It must be deliberately nurtured. 

Initially at least, nurturing the integration of SSHRC’s new programs in educa- 
tion is a leadership opportunity and obligation of deans of faculties of education. 
Theirs is a key role. They head the organization with the largest pool of relevant 
research talent. They are influential in shaping expectations for the use of that tal- 
ent. They are in a position to communicate effectively with the broader university 
community, provincial governments, teachers’ and trustees’ organizations, and local 
school districts. Thus, each of them is in a position to be the “powerful sponsor” 
and “continuing advocate” that has characterized every successful planning en- 
deavor of the past in Canadian education. 

If we had a “nurturer of the year” award in educational research in Canada 
for 1986, I’d bet the recipient would be a dean of education. 


WHW 
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BETTY COLLIS 
LLOYD O. OLLILA 


and 


LARRY D. YORE 


University of Victoria 


Predictive Validity of the Canadian Readiness Test 
and Selected Measures of Cognitive Development 
for Grade One Reading and Mathematics 
Achievement 


Correlational associations of the Canadian Readiness Test (CRT) and selected meas- 
ures of cognitive development with subsequent achievement in reading and mathe- 
matics were examined. Four subscales of the CRT were identified as being highly in- 
volved as predictors. Conservation of number demonstrated some potential as a pre- 
dictor of future achievement, but overall the addition of scores from Piagetian-type 
tasks to CRT data contributed little to the prediction of either reading or mathemat- 
ics achievement. The relationship between the constructs underlying the CRT and 
the cognitive development tasks was also examined. 


Prediction prior to Grade One of children’s readiness for reading and mathe- 
matics instruction and of their subsequent achievement in these areas is important 
for both practical and theoretical reasons. The practical concerns focus on the effi- 
cient and effective prediction of subsequent achievement using a concise, valid and 
reliable battery of tests. One of the purposes of this study is to further validate an 
existing readiness test in order to strengthen support for its use as an achievement 
predictor. Readiness testing is of interest from a theoretical perspective for at least 
two reasons: to aid in the identification of underlying skills and processes which are 
fundamental to learning how to read and how to compute mathematically, and also 
for the investigation of the relationship between cognitive development defined 


Dr. Collis is an Assistant Professor in the Department of Psychological Foundations, University of Victoria, with research inter- 
ests in measurement and evaluation of primary students, and computer applications in education. 

Dr. Ollila isa Professor in the Department of Communication and Social Foundations, University of Victoria, with research in- 
terests in primary language arts. 


Dr. Yore is an Associate Professor in the Department of Social and Natural Science, University of Victoria, with research inter- 
ests in cognitive development and science education. 
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Predictive Validity 


within a Piagetian framework and early school achievement. A second purpose of 
this study involves both these perspectives and focuses on the gain in theoretical 
knowledge and empirical prediction which may occur when readiness test results 
are supplemented with information derived from the administration of selected 
Piagetian tasks. 


The Canadian Readiness Test as a Predictor of Achievement. 


Studies by Gates, Bond and Russell (1939); Smith (1957); Barrett (1965); Bond 
and Dykstra (1967); and Johnson (1969) indicated predictive correlations between 
comprehension of oral language, usage of oral language, visual discrimination, audi- 
tory discrimination, knowledge of quantitative relationships, shape completion, 
sensorimotor skills related to handwriting, and following directions with later read- 
ing achievement. The Metropolitan Readiness Test (Hildreth, et al., 1965) and the 
Clymer-Barrett Prereading Battery (Clymer & Barrett, 1967) were designed around 
these results. However, Evanechko, Ollila, Downing, and Braun (1973) suggested a 
broader base was needed for effective readiness testing. They noted that: 

Less attention has been given to the non-perceptual components which constitute part of 
the total complex of reading readiness skills. Technical linguistic terms such as “word,” 
“letter,” “number,” “sound,” etc., are often used by teachers when talking to their first 
grade students. Similarly, teachers assume that children possess the ability to 
conceptualize and label real and vicarious experiences, that is, that they possess the abil- 
ity to associate meaning with verbal symbols. They also assume the child’s syntactical 
and morphological competence sufficient to enable him to form and comprehend 
grammatical structure in language. Further, the child’s cognitive functioning is infre- 
quently measured as an indicator of the ability to cope with the memory requirements of 
beginning reading. Similarly the rate at which the child can assimilate the visual linguis- 
tic stimuli is seldom considered by those predicting reading success. (p. 62) 


Based on this analysis, the Canadian Readiness Test (Evanechko, et. al., 1973), 
a battery of seven subscales, was developed to measure four general areas of 
prereading behavior—the child’s concept of the reading task, perceptual ability, 
linguistic competence, and level of cognitive functioning. In the original study, per- 
ceptual ability appeared to be the best predictor of reading achievement, followed 
closely by cognitive functioning ability, linguistic ability, and concept of the read- 
ing task. It was argued that subscales relating to each of these areas were important 
in strengthening both the diagnostic and predictive validity of reading readiness 
tests. In particular it was noted that word recognition behaviors were dependent 
upon the child’s readiness in letter recognition, learning rate, listening and seman- 
tics; comprehension of ideas depended upon readiness in letter recognition, seman- 
tics, and morphology; comprehension of instructions depended upon readiness in 
letter recognition, word matching, syntax, orientation of literacy, and ending 
sounds. Total achievement in reading was best predicted by the child’s readiness as 
measured by the subscales Letter Recognition, Semantics, and Beginning Sounds. 
The authors concluded that “to serve a diagnostic function, a readiness test should 
have a range of subtests including items which ensure the child’s concept of the 
reading task, his perceptual ability, linguistic competence, and cognitive 
functioning” (p. 78). 


Cognitive Development as a Predictor of Achievement 


Various researchers have explored cognitive development and related Piagetian 
tasks as predictors of reading and mathematics achievement (Waller, 1977). 
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Kingma and Koops (1983) found Piagetian tasks that measured seriation, conserva- 
tion and multiplicative classification to be equal or superior to traditional intelli- 
gence tests in predicting mathematics achievement. Taloumis (1979) found scores 
on conservation and measurement tasks to be highly predictive of mathematics 
achievement, and Dimitrovsky and Almy (1975) indicated early achievement of 
conservation ability to be predictive of later reading performance. Jordan and 
Brownlee (1981) reported results of a meta-analysis examining the relationship be- 
tween cognitive development and school achievement tests and indicated that 
Piagetian tasks account for approximately 29% of the variance in mathematics 
achievement and 16% of the variance in reading achievement within the studies in- 
volved in the meta-analysis. 

However, other researchers have not found Piagetian tasks to be effective readi- 
ness predictors. Brekke and Williams (1975) for example, found conservation not to 
be a significant predictor of reading achievement. Nelson (1980) presented a simi- 
lar conclusion about the predictive nature of cognitive functioning as measured by 
Piagetian tasks with respect to mathematics achievement. He summarized research 
in this area by noting: 

Ten years of research have failed to establish any practical connection between conserva- 
tion and mathematics learning. What is an essential and central concept in Piaget’s devel- 


opmental psychology seems to have little bearing on how children learn mathematics. 
(1980, p. xi) 


Arlin (1981) suggested that the lack of consistent evidence relating cognitive de- 
velopment tasks and achievement may be a result of the use of an insufficient or 
inappropriate battery of Piagetian tasks “with little rationale provided for the se- 
lection of one Piagetian task in preference to another” (p. 712). She found the three 
subsystems of Piagetian tasks—classification, seriation, and conservation—“to- 
gether appear to make significant contributions to a child’s achievement in reading 
and mathematics . . [and] that knowledge of a child’s general concrete operativity 
may be an important component of our understanding of the child’s general readi- 
ness for mathematics and reading instruction.” (p. 270) 

Against this background, the present study examines the following questions: 

1. Is the CRT an effective predictor of achievement at the end of Grade One? 


To what extent are the subscales of the CRT (and consequently its underlying 
theoretical model) involved in subsequent school achievement? 


3. Does the addition of selected Piagetian tasks to the CRT battery improve pre- 
diction of Grade One achievement? 


4. What is the relationship between the constructs underlying the CRT and the 
cognitive development constructs represented by the Piagetian task battery? Is 


this relationship pertinent to advancing what we know about how children 
learn? 


Method 
Sample 


A sample of 99 middle-class, mixed racial, English-speaking children enrolled in 
four Grade One classes located on lower Vancouver Island, British Columbia, Can- 


ada, was selected for this study. The subjects ranged in age from five years nine 
months to six years nine months at the beginning of the study. 
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Predictor Variables 
Cognitive Development 


Cognitive development involves a structured set of logico-mathematical abili- 
ties, processes and operations used to solve specific problem tasks. Special consid- 
eration being given to Arlin’s (1981) observation that many previous studies gave 
weak justifications for their selections of cognitive development variables, six 
Piagetian-type tasks were selected for this study on the basis that the cognitive 
skills embodied in the tasks appear related to reading and mathematics and that 
the acquisition of these skills was a reasonable expectation of Grade One children. 
The specific tasks selected were Conservation of Number, Conservation of Solid 
Amount, Conservation of Liquid Amount, Conservation of Length, Seriation of 
Length and Horizontal Reference Frames (Yore & Ollila, 1984). Conservation refers 
to an individual’s ability to recognize and rationalize the invariance of an identified 
quantity while other characteristics are changed. The logical thought embedded in 
conservation, specifically logical necessity, compensation, and reversibility, appear 
to reflect the logic needed to learn reading skills. Seriation, a logical grouping task, 
relates to the child’s ability to identify and apply an ordered length relationship to 
a set of rods. Seriation appears to parallel the temporal sequence necessary for sen- 
tence and paragraph comprehension. Horizontal Reference Frame, an infralogical 
grouping task, deals with a child’s conception of Euclidean space in which a 
consistent horizontal reference is viewed with changing, unrelated reference ob- 
jects. The horizontal/vertical reference frame involves similar orientational discrim- 
ination as involved in the discrimination of letters, specifically; b, p, q, and d. 

The tasks were administered using standard individual interview protocols and 
scored according to established criteria by a trained interviewer during September 
of Grade One. The scoring procedures utilized four ordered categories for the Con- 
servation and Seriation tasks and five ordered categories for the Horizontal Refer- 
ence Frame task. 


Canadian Readiness Test 
The CRT consists of seven subscales: 

1. Technical Language of Literacy. Sixteen items designed to measure the child’s 
knowledge of technical linguistic terms such as “letter,” “word,” etc. For ex- 
ample, the tester says, “Find each thing which is a ‘letter’.” The child draws a 
circle around all the “letters” found in an indicated row of symbols. 

2. Word Matching. Twenty-one items in which the child selects from a display of 
four words the one which “looks just like” another word printed separately and 
specified by the tester. 

3. Beginning Sounds. Nineteen items in which the child circles a picture whose 
name begins with the same initial phoneme as another word printed separately 
and specified by the tester. 

4. Semantics. Twelve items measuring the child’s ability to categorize. The task 
involves selecting from five pictures three which are conceptually identical. 

5. Visual Letter Recognition/Letter-Name Knowledge. Twenty-one items measur- 
ing the child’s ability to discriminate and recognize visually upper and lower 
case letters in relation to their letter-names. The tester names a letter and the 
child selects one of four alternatives. 
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6. Noun (learning rate on concrete words). Eighteen items to test the child’s abil- 
ity to select one of three printed words named by the tester. Prior to the test 
nine printed concrete nouns of varying length and configuration have been 
taught. 

7. Non-noun (learning rate on abstract words). Eighteen items to test the child’s 
ability to select one of three printed words named by the tester. Prior to the 
test, nine printed abstract non-noun words of varying length and configuration 
have been taught. 


The CRT was administered at the start of the Grade One year during the 
middle of September by each classroom teacher. Testing periods varied in length 
from 30 to 45 minutes. No more than three subscales were administered during any 
one testing session. Kuder-Richardson reliability coefficients of the seven subscales 
ranged from .78 to .95 with the single most highly reliable measure being the test of 
Visual Letter Recognition/Letter- Name Knowledge. : 


Correlates 


Reading Achievement 


Four parts of the Stanford Achievement Test (Madden, Gardner, Rudman, 
Karlsen & Merwin, 1972) were administered at the end of Grade One to measure 
reading achievement. The reading subscales used were the Vocabulary, Reading 
Part A (word reading), Reading Part B (reading and comprehension) and Word 
Study Skills. The Stanford Achievement Test was administered and scored during 
May of the Grade One year by the classroom teachers according to the procedures 
in the test manual. 


Mathematics Achievement 


A school district-developed mathematics test battery, designed to measure 
mathematics achievement, was administered at the same time as the reading 
achievement tests. The mathematics battery consisted of four subtests. 


1. Basic Facts and Computation. Thirty addition and subtraction items. 


2. Numeration. Thirty-four items involving counting, associating numeral names 
and numerals, place value concepts, missing addend addition, inequality 
symbols, and multiplication basic facts. . 

3. Problem Solving. Fifteen word problems, predominantly requiring addition or 
subtraction. 

4, Mathematical Concepts. Fourteen questions relating to money, geometry, time 
and comparative vocabulary. 

Kuder-Richardson internal reliability coefficients ranged from .80 to .87 for the 

subscales. Content validity of the battery was supported through the correspond- 


ence of the items with the Grade One mathematics curriculum used in the school 
district. 


Results 


Correlational Analyses 


Table 1 shows the intercorrelation matrix for the variables. Strong correlations 
can be seen between Solid, Liquid, and Length Conservation tasks, and within the 
CRT subscales, the mathematics battery, and the reading achievement subtests. 
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A series of canonical correlations was conducted in order to test the strength of 
the relationships between various sets of the variables. When the CRT subscales 
were taken as one set of predictors and the mathematics achievement tests were 
taken as the second set, a significant interrelationship was found (F=3.849, 
df=28,250, p<.001). A similar result occurred when the reading achievement tests 
were the correlates instead of the mathematics tests. Again, a significant relation- 
ship was found between the CRT and the reading achievement variables (F= 2.609, 
df= 35,288, p<.001). 

In order to examine the impact of the addition of the Piagetian tasks to the 
predictive power of the CRT, a canonical correlation was conducted using the com- 
bined set of CRT and Piagetian scores as one group of predictors, and the mathe- 
matics achievement tests as the second group. A significant overall relationship was 
again found (F=2.75, df=52,246, p<.001). Because of this significant overall cor- 
relation an examination of the contribution of each of the subscales was justified 
and a series of univariate F-tests with (4,75) df was conducted. Each of the seven 
CRT subscales was significantly involved in the overall correlation (p<.001) but 
only two of the Piagetian tasks were significantly involved—Number Conservation 
(p<.001) and Seriation (p<.01). 


A significant overall correlation was also found between the CRT-Piaget scores 
as one set of predictors and reading achievement scores as the second set (F=1.85, 
df=65,297, p<.001). Again because of this overall significance, an inspection of 
each of the series of univariate F-tests was justified. As before, all seven of the 
CRT subscales were significantly involved as predictors of reading achievement 
(Word Matching, p=.015; Semantics, p=.03; remaining subscales, p<.01) but only 
two Piagetian tasks were significantly involved in the association with reading 
achievement. These tasks were Number Conservation (p<.01) and Liquid Conser- 
vation (p<.05). 


In summary, various canonical correlations showed that the seven reading read- 
iness variables were consistently important in the interpretation of the canonical 
variables. Among the Piagetian tasks, Conservation of Number was the variable 
most often involved in the interpretation. 


Factor Analysis 


A final investigation involved the relationship between the underlying theoreti- 
cal constructs which the CRT and the set of six Piagetian tasks were meant to rep- 
resent. A principal components factor analysis of the intercorrelations between 
these two sets of measures was utilized to explore the pattern of underlying rela- 
tionships. Table 2 shows the factor matrix after varimax rotation. It can be seen 
that the first factor, which accounts for 30% of the variance, is defined by the 
CRT. None of the Piagetian tasks is involved in this factor. Three of the Conser- 
vation tasks (Solid, Liquid, and Length) define the second factor, which accounts 
for 15% of the variance. None of the CRT variables is involved with this cluster. 
The third factor, accounting for 10.6% of the variance, is defined by Number Con- 
servation, Seriation, Word Matching, Beginning Sounds, and Semantics. The 
fourth factor, accounting for 8% of the variance, is defined by a single variable— 
Horizontality. 


Discussion 


The results support the predictive validity of the CRT for both subsequent 
reading achievement and mathematics achievement. 


Predictive Validity 


TABLE 2 


CRT and Selected Piagetian Tasks: 
Results of Principal Components Analysis after Varimax Rotation 


Variables Factor 1 Factor 2 Factor 3 Factor 4 
Solid Cons. 851 

Number Cons. ot 

Liquid Cons. .850 

Length Cons. .728 

Horizon .916 
Seriation 657 

Tech. Lang. 042 

Letters .822 

Word Match 41 

Begin. Sds. .526 585 

Semantics .588 

Non-noun 872 

Noun .830 

Kigenvalues 3.94 sis) 1.38 1.04 


Within the CRT battery, four of the seven subscales were the most powerful 
predictors. These four are Beginning Sound, Semantics, Noun, and Non-Noun. In 
the intial validation study of the CRT, (Evanechko, et al., 1973), the Letter Recog- 
nition subscale was the best single predictor of reading achievement, followed by 
Noun, Semantics, and Beginning Sounds. The letter Recognition subscale no longer 
serves as a significant predictor. A distinct ceiling effect was noted for this subscale 
(X=18.23, S.D.=3.4), perhaps indicating many children are now thoroughly ex- 
posed to letters before they start Grade One. Beginning Sounds, Semantics, and 
Noun learning rate remain as useful predictors of reading achievement. 


The six selected Piagetian tasks, as indices of cognitive development, appear to 
be measuring three groups of competencies as indicated by the factor analysis, but 
contribute relatively little to the prediction of either reading or mathematics 
achievement when used in conjunction with the CRT. It is especially interesting 
that the CRT provides a better prediction of mathematics achievement than do the 
Piagetian tasks, given the strong theoretical rationale between the cognitive devel- 
opment paradigm and the child’s development of mathematical concepts. These 
findings parallel Nelson’s (1980) observations and have implications for mathemat- 
ics education research and methodologies typically prescribed for early childhood 
mathematics, as both are frequently established around a cognitive development 
framework. 

The relationship between the constructs underlying the CRT and those 
underlying the set of six Piagetian measures is interesting. It would seem that the 
clear separation between the CRT variables and four of the six Piagetian tasks 
would mean those four tasks would add extra information to the overall association 
between the predictor variables and the post-Grade One scores. However, the other 
two Piagetian tasks, Number Conservation and Seriation, are the ones which were 
significantly involved in the overall correlations between predictors and 
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achievement. Although the four Piagetian tasks (Solid, Liquid, and Length Conser- 
vation, and Horizontality) were chosen as providing a reasonable theoretical fit for 
cognitive skills needed in Grade One reading and mathematics, they were seen to 
measure something different from that which was measured by the CRT. Was the 
rationale for the selection of the specific Piagetian tasks faulty? As well as describ- 
ing something different from the skills measured by the CRT, the dimensions de- 
scribed by the four Piagetian tasks do not seem to be measured by post-Grade One 
achievement tests. What are the implications of this lack of congruence between 
what is seen to be important in the Grade One curriculum, with respect to year-end 
testing, and what is involved cognitively in the various Piagetian tasks? As these 
tasks seem to be tapping something that is not reflected in post-Grade One reading 
and mathematics achievement scores, does this imply that we are neglecting to 
evaluate some important aspects of children’s cognitive growth within our typical 
school achievement batteries? Or, alternatively, might these aspects of cognitive 
development be relatively unimportant in the actual learning of mathematics and 
reading? 

One of the dimensions identified in the factor analysis, Factor 3, is a complicat- 
ed factor which needs closer inspection. Variables with interpretable loadings on 
this factor include ones from both batteries which seemed most involved in the 
overall prediction of reading and mathematics achievement—Number Conservation 
and Seriation from the Piagetian set and Beginning Sounds and Semantics from 
the CRT. What is it that these measures have in common and why are they espe- 
cially important with respect to subsequent school achievement? Subsequent study 
is needed to see if this result can be replicated but more importantly to see if it can 
be integrated into a theoretical framework which describes how children learn. 

Taken together, the CRT and the set of six Piagetian tasks do form an impres- 
sive prediction of subsequent achievement, accounting for 56% of the variance in 
reading achievement and 65% of the variance in mathematics achievement. How- 
ever, the Piagetian tasks contribute relatively little to this overall prediction, and 
add considerable logistic demands involved in the administration of the individual 
interviews. The CRT tests, used without the Piagetian measures, appear well 
supported as reliable and valid predictors of Grade One achievement in both read- 
ing and mathematics. 


Note 


alt Inquiries should be sent to Dr. Betty Collis, Department of Psychological Foundations, Faculty of Education, University of 
Victoria, P.O. Box 1700, Victoria, B.C., Canada, V8W 2Y2. 
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Item responses on nine WISC-R subtests (Object Assembly, Coding, Mazes excluded) 
from 273 Inuit children aged 7-0 to 8-11 years, 10-0 to 11-11 years, and 13-0 to 14-11 
years in the Kitikmeot and Keewatin districts of Canada’s Northwest Territories 
were examined with respect to item-total correlations and relative item difficulty. 
Overall, the WISC-R was found to be a reasonably internally consistent measure of 
the Wechsler intelligence constructs for these children. The majority of test items 
demonstrated significant correlations with the full test score, and also showed ade- 
quate and appropriate discrimination between the Wechsler verbal and performance 
constructs. While there was little evidence to suggest a need for major item revisions 
or reordering, the finding that fully one-third of the test items fell within the extreme 
deciles of relative difficulty for these children and, therefore, failed to add signift- 
cantly to test variance is of some concern. The Verbal Scale tended to contain more 
items that were too difficult, whereas the Performance Scale tended to contain more 
items that were too easy. The implications of these findings are discussed with re- 
spect to the WISC-R’s use as a global intelligence measure with Inuit children. 


Over the last quarter-century, the Wechsler Intelligence Scale for Children 
(WISC) and its 1974 revision, the WISC-R, have found increasing acceptance with 
Canadian psychologists. Today the WISC-R is clearly the “test of choice” for the 
assessment of the intellectual functioning of individual Canadian children. How- 
ever, along with this increased popularity has also come increasing concern with re- 
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gard to whether using an American-normed psychometric instrument results in sig- 
nificant cultural bias against Canadian majority or ethnic and racial minority chil- 
dren. 


A number of studies have examined the effects of “Canadianizing” WISC 
Information items (Spreen & Tryk, 1970), and WISC-R Information and Compre- 
hension items (Vernon, 1974, 1976, 1977). Recently, Marx (1984) compared the 
item performance of two samples of Canadian children administered Vernon’s 
(1977) modified WISC-R Information subtest to item difficulty data obtained for 
Information from the 1974 WISC-R standardization. Marx concluded that 
“Vernon’s suggested changes to the WISC-R Information subtest resulted in higher 
raw scores for this subtest for both clinical and nonclinical samples” (Marx, 1984, 
p. 34). Such results are suggestive of some WISC-R cultural bias. But, as yet, there 
is no research comparing WISC-R item performance of a normative sample of Ca- 
nadian children to the WISC-R standardization data using a standard administra- 
tion, nor is there research directly comparing performance on the “Canadianized” 
WISC-R to the standard WISC-R in a normative Canadian sample. 


The majority of studies examining the standard WISC-R performance of Cana- 
dian majority children have focused on more global subtest and Scale score pat- 
terns of differential responding rather than on the analysis of individual item 
responses (e.g., Dash, Dennis, Mueller, Mancini, Snart, & Short, 1983; Peters, 1976; 
Wersh & Briere, 1981). Such studies have failed to clearly establish WISC-R per- 
formance differences attributable to cultural differences between Canadian normal 
and clinic-referred children, and the American standardization sample (see Dash et 
al., 1983). While Canadian majority children may demonstrate some increased dif- 
ficulty with a few specific WISC-R items, their overall performance does not ap- 
pear significantly affected. 

But what of Canada’s ethnic and racial minority children? Might not the 
WISC-R and its norms be significantly biased against these groups? Although the 
relatively abundant American research literature on WISC-R bias with American 
minorities (e.g., Mishra, 1982; Sandoval, 1979; Sandoval & Miille, 1980; Sandoval, 
Zimmerman, & Woo-Sam, 1983) has generally found little or no overall test bias, 
one cannot simply generalize such findings to the Canadian situation, where the re- 
search has focused almost exclusively on the indigenous Native population. 

Although rather sparse and fragmentary, research and ancedotal data support 
the generalization that Canadian Native children’s performance on tasks assessing 
visual-perceptual and nonverbal analytic skills compares quite favorably to that of 

Canadian majority children, but compares poorly on tasks that assess verbal-con- 
ceptual and linguistic skills (e.g., Kleinfield, 1973; MacArthur, 1968; St. John, 
Krichev, & Bauman, 1976; Taylor & Skanes, 1976). 

Seyfort, Spreen, and Lahmer (1980) evaluated WISC-R item responses and dif- 
ficulty gradients in a sample of 177 Native children aged 6 to 18 years living on re- 
serves and attending regular schools in Southwestern British Columbia. Single item 
response frequency and difficulty gradients were analyzed by coding items as 0 for 
incorrect and 1 for correct responses; partial or qualitative scores were not recorded 
separately. Their data confirmed previous findings insofar as their sample dis- 
played average visual-perceptual analysis skills and relatively poor verbal skills. 

But of more significance, Seyfort et al. found that only approximately 60% of the 
157 test items answered correctly by more than one child correlated significantly 
(p<.05) with own subtest total score. This apparent lack of “validity” (p. 18), as 
indexed by test-total correlations for many of the WISC-R items, suggests that 
either these Native children were having some difficulty in properly understanding 
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many of the items, or that many WISC-R items were tapping rather different skills 
or abilities than in the standardization sample. Only approximately 25% of the 
items answered correctly by at least one child were of moderate difficulty, whereas 
approximately 43% of such scorable items fell within the extreme deciles of rela- 
tive difficulty. However, any analysis of WISC-R item difficulty that collapses data 
across a broad age range will find a rather large number of items to have a mean 
difficulty index outside the “desirable” range of .30 to .70 suggested by Anastasi 
(1976) because no single item is designed to be equally difficult to children for all 
ages. A statement regarding inappropriate difficulty levels for specific WISC-R 
items requires item response data from less broad age groups. 


The purpose of the present study was to examine WISC-R item-total correla- 
tions and relative item difficulty for three age pools of Canadian Inuit children 
drawn from a more complete norming sample. It was believed that such an exami- 
nation of the item response and difficulty gradient of the WISC-R would facilitate 
more accurate and meaningful assessment of Inuit children in Canada’s Northwest 
Territories. 

Method 
Subjects 

The subjects used for the present analysis were all children (N=273; 137 
male + 136 female) aged 7-0 to 8-11 years, 10-0 to 11-11 years, and 13-0 to 14-11 
years included within a larger sample of children assessed as part of a psychometric 
test norming project in Canada’s Northwest Territories in 1981 (see Mulcahy & 
Watters, 1983). The original normative sample consisted of 366 students (185 
males, 181 females) from 13 villages in the Kitikmeot and Keewatin districts of the 
Northwest Territories. These 366 children were randomly selected from the com- 
plete school registry after stratification by age and sex. The population of these re- 
gions is approximately 90% Inuit, 8.5% Caucasian, and 1.5% Dene. This normative 
sample, the sampling procedures used and the testing procedures are described in 
detail by Mulcahy & Watters. 


Procedures 


These data were collected between October 15 and November 15, 1981. The 
WISC-R was administered by nine trained psychometricians (4 male, 5 female). All 
testers received extra training with respect to the assessment of northern Native 
children prior to travelling to the Territories. All subjects were individually tested 
in their school location with some testing occurring on non-school days or early 
evenings. 

A total of seven WISC-R items in the Information (5), Similarities (1), and 
Comprehension (1) subtests were modified to better reflect Canadian content (see 
Vernon, 1974, 1976, 1977). Also, in consultation with a practicing psychologist in 
the Northwest Territories with extensive experience in administering the WISC-R 
to Native children, it was decided to modify slightly the wording of three further 
items (one in Similarities and two in Arithmetic) to enhance understanding. Minor 
modifications to the query procedures outlined in the WISC-R Manual 
(Wechsler, 1974) were also made for the Similarities, Vocabulary, and Picture Com- 


pletion subtests.’ In all other respects, the WISC-R was administered in the stand- 
ard manner. 


Scoring Reliability. Testers were paired and each checked the other’s WISC-R 
protocols for correct item scoring and summation. As well, every protocol was fur- 
ther checked by the second author for correct scoring. Since receptive and expres- 
sive language deficiency is known to interfere with WISC-R performance, an at- 
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tempt was made to test only those selected children who, according to teachers and 
testers, were “sufficiently proficient” in the English language. All testers were also 
instructed to record on each test protocol any concerns that a lack of receptive or 
expressive language skill might have interfered with adequate performance. A total 
of 32 such protocols were determined invalid upon closer scrutiny by the authors 
and were deleted from the norming sample. Thus a total of 366 WISC-R protocols 
were retained for norming analysis. For the present item analyses, the protocols of 
all children aged 7-0 to 8-11 years were pooled into one group (G1; n=71), all chil- 
dren aged 10-0 to 11-11 years were pooled into another group (G2; n=101), and all 
children aged 13-0 to 14-11 were pooled into a third group (G3; n=96). 


Data Analysis 


Since ANOVAs on subtest total scores by age and sex carried out on the origi- 
nal norming sample (see Mulcahy & Watters’) revealed no significant age by sex in- 
teractions or sex main effects, it was decided to collapse data across sex for all item 
analyses. 


Single item response frequencies and item difficulty gradients were generated 
by coding all item responses as either 0 for incorrect or 1 for correct. Items were 
considered correctly answered if any credit could be allowed. While this procedure 
functions to reduce response variance and lower item-total correlations, it was fol- 
lowed to increase the ordinal consistency of test items and to simplify data analysis 
(Seyfort et al., 1980). No error is created by such a scoring procedure for the 
Information, Picture Completion, and Arithmetic subtests, and only a relatively 
small amount of error is created for the Similarities, Vocabulary, Comprehension, 
and Digit Span subtests. Significantly greater error is introduced for such subtests 
as Picture Arrangement, Block Design, Object Assembly and Mazes wherein chil- 
dren can obtain scores greater than two for items. But for Picture Arrangement 
and Block Design only variance due to speed of performance is completely lost by 
using a “0/1” scoring procedure. It is only with Object Assembly and Mazes that too 
much variance due to “correctness of performance” is lost to permit a reasonably 
accurate analysis of relative item difficulties. Therefore, these latter two subtests 
were excluded from the present analysis. The Coding subtest was also excluded 
from analysis because it does not contain an increasing item difficulty component. 
Item analysis, therefore, was carried out on only the 191 items within the remain- 
ing six Verbal subtests (142 items) and three Performance subtests (49 items). 


A count of the number of children in each age pool correctly answering each 
test item and a calculation of the relative proportion of each pool correctly answer- 
ing each test item were performed. Test items in each subtest were ranked in order 
of relative difficulty on the basis of the proportion of children in each age pool who 
answered the item correctly. These data were used for the item difficulty analyses. 
The Spearman rho formula was used to calculate the rank-order correlation be- 
tween the standardized subtest item order and a revised order based on obtained 
difficulty data for each subtest at each of the three age levels. The N used for each 
of these calculations was the total number of subtest items answered by at least 
one child within each age pool. Such items will be termed “scorable items” for the 
purposes of this paper. 

Item scores (0/1) were also used to generate a set of three Product-Moment cor- 
relations (PM) for each of the 191 items being analyzed. Each test item was corre- 
lated with a “Verbal Score” consisting of the sum of Information, Similarities, 
Arithmetic, Vocabulary and Comprehension subtest scores; with a “Performance 
Score” consisting of the sum of Picture Completion, Picture Arrangement, and 
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Block Design subtest scores; and a “Full Score” consisting of the sum of the Verbal 
and Performance Scores. These item-total correlation coefficients were interpreted 
as discrimination indices related to test construct validity at the item, rather than 
subtest, level (see Vance, Gaynor, & Coleman, 1977; Seyfort et al., 1980). 


Results and Discussion 
Subtest Item Difficulty and Validity 


Tables 1 through 4 present the results of the item analysis on each of nine 
WISC-R subtests for each of three age pools. These results have been tabulated at 
each age level into six columns as follows: 1) number of subjects who correctly an- 
swered the item (f), 2) the relative proportion of subjects who correctly answered 
the item (P), 3) an obtained item order based on relative item difficulty (R), 4) the 
PM correlation between the subtest item and the Verbal Score (rVS), 5) the PM 
correlation between the item and the Performance Score (rPS), and 6) the PM cor- 
relation between the item and the Full Score (rFS). Items are tabulated in the 
same sequence as given within the WISC-R Manual (Wechsler, 1974). Spearman 
rank-order correlations between standardized item order and revised orders based 
on obtained data are also presented at the bottom of each table. For reasons of par- 
simony, only scorable items in any age pool are tabulated, as can be determined 
with reference to the tables. 

Two aspects of test items will be discussed for each of the nine subtests at each 
of the three age levels: item validity and item relative difficulty. Relative item diffi- 
culty is indexed by the proportion of subjects taking a test who correctly answer 
the item. Items correctly answered by between 30 and 70 percent of subjects are 
considered moderately difficult and desirable from a test construction point of 
view. Correctly answered items falling within the extreme deciles of difficulty (P=0 
to .10, or .90 to 1.00) are either too easy or difficult for the majority of children and 
fail to add significantly to total test variance (see Anastasi, 1976). Item validity 
may be inferred from the degree of relationship between item responses and some 
criterion of performance within or external to the test (see Anastasi, 1976). For the 
present analysis, a traditional item difficulty index was generated by determining 
the relative proportion of children in each age pool who answered an item cor- 
rectly, and three separate indices of item validity were generated by correlating 
responses to each item with Verbal score, Performance Score, and Full Score. 


Overview 


For ages 7-0 to 8-11 (G1), of the 191 items administered, only 130 (83 Verbal, 
47 Performance) were scorable. The corresponding number for ages 10-0 to 11-11 
(G2) are 155 items (106 Verbal, 49 Performance), and 13-0 to 14-11 (G3), 170 items 
(121 Verbal, 49 Performance). For G1, only 24% (G2: 24%; G3: 20%) of Verbal 
items and 438% (G2: 24%; G3: 10%) of Performance items are of moderate diffi- 
culty (P=.30 to .70). For G1, for the Verbal Scale, as many as 34% (G2: 31%; 
G3: 31%) of scorable items fall within the lower decile of difficulty (too hard), 
whereas only 5% (G2: 13%; G3: 21%) fall within the upper decile (too easy). In 
contrast, only 15% (G2: 8%; G3: 2%) of Performance Scale items fall within the 
lower decile and 17% (G2: 31%; G3: 49%) fall within the upper decile. These items 
failed to add significantly to the total test variance for these Inuit children. 


For all three groups, the majority of scorable test items demonstrate a pattern 
of item-total correlations consistent with the WISC-R’s construction. For the 
Verbal Scale, for G1, 72% (G2: 69%; G3: 69%) of scorable subtest items correlate 
significantly with the Verbal Score while only 34% (G2: 31%; G3: 25%) correlate 
significantly with the Performance Score. Similarly, 70% (G2: 73%; G3: 65%) of 
scorable Performance Scale items correlate significantly with the Performance 
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Score and only 30% (G2: 31%; G3: 22%) correlate significantly with the Verbal 
Score. A total of 68% of all 130 (G2: 67% of 155; G3: 59% of 170) scorable test 
items correlate significantly with the Full Score. 


Information Subtest. Examination of Table la for the 7-8 year pool (G1) re- 
veals that 73% (G2: 81%; G3: 79%) of scorable items correlate significantly with 
the Verbal Score. Only two items (G2: 2 items; G3: 3 items) also correlate signifi- 
cantly with the Performance Score. This pattern suggests that Information is a rea- 
sonably good measure of the WISC-R verbal construct for each age group. Table 1a 
also reveals that Information is only moderately well ordered at age 7-8 with re- 
spect to obtained item difficulty (rho=.84; G2: rho=.94; G3: rho=.96), although 
only one item, #8 (“Name . . . after March.”), is misplaced by more than two ranks 
from its standard placement (easier). For G2, four scorable items are misplaced by 
more than two rank-positions (all easier); item #8, #24 (“How tall .. . Canadian 
man?”), and #19 (“What are the names .. . oceans . . . Canada?”), modified to be 
more Canadian in content, and #25 (“What .. . barometer?”). For G3, only three 
scorable items are misplaced by more than two ranks, item #19 (easier), #12 
(“Who ... America?”), an unchanged American-content item (harder), and #20 
(“How many .. . ton?”; harder). One scorable Information item is of moderate diffi- 
culty for G1 (G2: 29%; G3: 13%) while 55% (G2: 62%; G3: 63%) fall within the ex- 
treme deciles of difficulty. A sharp increase in item difficulty is apparent for Gl 
beginning with item #4 (G2: #10; G3: #12). 


Similarities Subtest: Table 1b reveals that, for G1, 83% (G2: 92%; G3: 88%) of 
the scorable Similarities items correlate significantly with the Verbal Score. How- 
ever, fully 67% (G2: 69%; G3: 47%) of scorable items also show significant positive 
correlation with the Performance Score and, therefore, may be considered some- 
what weak in scale discrimination. At 7-8, there is reasonably good consistency in 
obtained item placement compared to the standardized item order (rho=.93); with 
only one scorable item misplaced by more than two ranks from standard (easier), 
#15 (“first-last”). The obtained item order for G2 is only slightly discrepant from 
the standardized order (rho=.89; G3: rho=.94); with two scorable items (G3: only 
item #3) misplaced by more than two ranks (harder), #3 (“shirt-hat”) and #6 
(“beer-wine”). For G1, one scorable item is of moderate difficulty (G2: 62%; 
G3: 41%), while fully 75% (G2: 31%; G3 30%) of scorable items fall within the 
lower decile of difficulty (too hard). No items fall within the upper decile (too easy) 
for any age group. For all 3 groups, a dramatic increase in item difficulty is appar- 
ent starting with item #10 (“pound-yard”). 

Arithmetic Subtest. Table 2a shows that, for G1, 71% (G2: 56%; G3: 38%) of 
the scorable arithmetic items correlate significantly with the Verbal Score, but 
43% (G2: 50%; G3: 19%) of Arithmetic items also correlate significantly with the 
Performance Score. Thus Arithmetic appears to be a rather poor discriminator be- 
tween the WISC-R verbal and nonverbal constructs, particularly at age 10-11, and 
at 13-14, arithmetic does not appear to be a particularly strong measure of any of 
the three WISC-R scales or constructs. Arithmetic items are quite well ordered 
with respect to obtained relative difficulty for all groups (G1: rho=.98; 
G2: rho=.997; G3: rho=.97) with no items misplaced by more than two ranks from 
standard for G1 and 2, and only item #6 significantly misplaced from standard for 
G3. For G1, arithmetic contains a relative large proportion of scorable items (38% ) 
(G2: 1 item; G3: 3 items) of moderate difficulty, but another 38% (G2: 67%; 
G3: 50%) of scorable items also fall within the extreme deciles of difficulty and fail 
to add greatly to total subtest variance. A moderately large increase in difficulty 
occurs for G1 with item #10 (G2: #15). 
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Vocabulary Subtest. Examination of Table 2b reveals that, for G1, 93% 
(G2: 100%; G3: 96%) of the scorable Vocabulary items correlate significantly with 
the Verbal Score, but 33% (G2: 2 items; G3: 26%) of scorable Vocabulary items 
also correlate significantly with the Performance Score. Item rankings based on ob- 
tained relative item difficulty are close to standardized placement for Gl 
(rho=.95); with only one item deviating from standard by more than two ranks 
(harder), #9 (“thief”). Vocabulary appears to be a very strong and efficient discrim- 
inator between verbal and nonverbal skills for age 10-11 Inuit children. Vocabulary 
items are also relatively well ordered for this group (rho=.95); with only two 
scorable items misplaced by more than two ranks from standard, #9 (harder) and 
#17 (“nuisance”; easier). For G3, rho=.96 and four scorable items are misplaced by 
more than two ranks, #20 (“migrate”), #23 (“mantis”), #25 (“belfry”), all easier, 
and #18 (“harder”; harder). For G1, only 27% (G2: 22%; G3: 22%) of scorable 
Vocabulary items are of moderate difficulty, and another 27% (G2: 28%; G3: 22%) 
fall within the lower decile of difficulty (too hard). No items fall within the upper 
decile for G1 and G2. A large increase in relative difficulty is apparent with item 
#9 for G1. 


Comprehension Subtest. Table 3a shows that, for G1, all of the scorable Com- 
prehension items (G2: 67%; G3: 65%) correlate significantly with the Verbal Score, 
and only two items (13%) also correlate significantly (G2: 18%; G3: 18%) with the 
Performance Score, giving the Comprehension subtest relatively good overall scale 
discrimination. Table 3a also reveals quite a bit of inconsistency in obtained item 
placement compared to the standardized placement (rho=.77) at 7-8; with six 
scorable items misplaced by more than two ranks from standard, #8 (“Why .. . li- 
cence plates?”), #10 (“Why .. . on letters?”), #11 (“Why . . . meat packing 
plants?”), all harder, #14 (“How are .. . hardcover books?”), #15 (“Why . . . prom- 
ise be kept?”), and #16 (“Why .. . in cloth?”), all easier. Comprehension items are 
moderately well ordered with respect to difficulty for G2 (rho .87; G3: rho=.93); 
with four items (G3: only 2 items, #11 + #12) misplaced by more than two ranks 
from standard, #11 and #12 (“Why ... street beggar?”), both harder, and #15 and 
#16 (both easier). Approximately 38% (G2: 2 items; G3: 2 items) of scorable Com- 
prehension items are of moderate difficulty for G1 while another 56% (G2: 2 items) 
fall within the lower decile of difficulty (too difficult). No items fall within the 
upper decile of difficulty for G1 and G2. For G3, 29% of scorable items fall within 
the extreme deciles of difficulty. A moderate increase in item difficulty is apparent 
beginning with item #7 (“In what ways . . . of wood?”), for all three age pools. 


Digit Span Subtest. Table 3b reveals that, for G1, only 50% (G2: 1 item; G3: 2 
items) of the Digits Forward scorable items correlate significantly with the Verbal 
Score, and only one item (G2 and G3: 0 items) correlates significantly with the 
Performance Score, although the majority of these items (G2: only 2 items; 
G3: only 3 items) do demonstrate moderate positive correlation with the Full Score 
at age 7-8. These item-total correlate patterns suggest that Digits Forward is not a 
particularly good measure of any of the WISC-R constructs with the Inuit. Digits 
Forward items are quite well ordered with respect to obtained item difficulty 
(G1: rho=.93; G2: rho=.96; G3: rho=.98); with no items displaced by more than 
two ranks from standard. For G1, only one item is of moderate difficulty, whereas 
38% of scorable items fall within the lower decile of difficulty (too hard). No items 
appear too easy. A dramatic increase in relative difficulty is apparent after the sec- 
ond item-pair (5 digits). For G2, only one item-pair (#5 & 6) is of moderate diffi- 
culty while the first two pairs (#1 & 2, 3 & 4) fall within the upper decile of diffi- 
culty (too easy) and the fifth item-pair (#9 & 10) falls within the lower decile (too 
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hard). Only two items are of moderate difficulty for G3 and fully 69% fall within 
the extreme deciles of difficulty. For Digits Backward (Table 3c), for G1, the ma- 
jority of these items (57%), (G2: 0%; G3: 1 item) correlate significantly with the 
Verbal Score; although 43% (G2: 3 items; G3: 1 item) also correlate significantly 
with the Performance Score. Thus, Digits Backward is also a relatively weak dis- 
criminator between the verbal and performance constructs of the WISC-R. As with 
Digits Forward, Digits Backward shows quite a bit of consistency in item order 
(G1: rho .96; G2: rho=.95; G3: rho=.99), with no item misplaced by more than two 
ranks. Only two items are moderately difficult for G1, & G2 (G3: 1 item) and an- 
other two fall within the lower decile of difficulty (too hard), for G1, while no items 
are too easy. A large increase in difficulty is apparent after the second pair of items 
(4 digits) for G1. The first item-pair (#1 & 2) is too easy and the fifth item-pair 
(#9 & 10) is too difficult for G2; for G3, only two items (#5 & 6) are of moderate 
difficulty whereas 45% fall within the extreme deciles of relative difficulty. 


Picture Completion Subtest. An examination of Table 4a reveals that, for G1, 
58% (G2: 54%; G3: 46%) of the scorable items correlate significantly with the 
Performance Score, and only 19% (G2: 15%; G3: 15%) correlate significantly with 
the Verbal Score, suggesting moderately good discrimination for the Performance 
Scale for G1 and 2, whereas it is not a particularly strong measure of any of the 
three WISC-R constructs for the older Inuit. The Picture Completion subtest also 
is quite well ordered with respect to item difficulty for G1 (rho=.96; G2: rho=.88; 
G3: rho=.95) with only five of 26 items misplaced by more than two ranks from 
standard for G1: #12 (“man”), #13 (“door”) and #18 (“scissors”) are easier, and #6 
(“mirror”) and #10 (“dresser”) are more difficult. For G2, however, 13 items are 
misplaced by more than two rank positions from standard: #11 (“belt”), #12, 18, 
18, 24 (“telephone”), #25 (“profile”), all easier, and #5 (“cat”), #8 (“elephant”), #9 
(“ladder”), #10, 15 (“girl running”), #21 (“cow”), and #23 (“house”), all more diffi- 
cult. For G3, Picture Completion is really only relatively well ordered for the first 
few items; 10 of the 26 scorable subtest items are misplaced by more than two 
ranks from standard. Items #8, 10, 15, 21, and 22 (“thermometer ”) are more diffi- 
cult; items #11, 14 (“playing card”), #17 (“boy”), #24, and 25 are easier. For Gl, 
Picture Completion includes a relatively large portion of moderately difficult items 
(42%; G2: 19%; G8: 1 item) and items within the extreme deciles of difficulty 
(388%; G2: 43%). For G3, 54% are too easy and only 1 item is too hard. Thus, this 
subtest includes a relatively large portion of items (15%) that are clearly too easy 
for even the very young Inuit children (G1). A moderate increase in difficulty is ap- 
parent beginning with item #20 for G1; item #21 (“cow”), for G2 and G3. 


Picture Arrangement Subtest. Table 4b reveals that, for G1, fully 92% 
(G2: 75%; G8: 75%) of the scorable items correlate significantly with the 
Performance Score, although 50% (G2: 25%; G3: 0%) of scorable items also corre- 
late significantly with the Verbal Score; suggesting that for the young Inuit (G1) 
this subtest contains a relatively large “verbal” component to performance. Picture 
Arrangement appears to be a reasonably good measure of the WISC-R performance 
construct for the older groups. This subtest is also reasonably well ordered with re- 
spect to relative difficulty for the children (G1: rho=.90; G2: rho=.97; 
G3: rho=.93); with only two items misplaced by more than two ranks from stand- 
ard for G1 (G2=0 items; G3: item #9); #9 (“chase”) is easier, while #6 (“rush”) is 
more difficult. Fully 50% of Picture Arrangement items are of moderate difficulty 
for G1 (G2: 42%; G8: 3 items only) and two items (#1 & 2) are clearly too easy 
(G2: items #1, 2, 3; G3: items 1 to 4). 


Block Design Subtest. Table 4c shows that, for G1, 78% (G2: 73%; G3: 64%) of 
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the scorable Block Design items correlate significantly with the Performance Score, 
whereas only 33% (G2: 27%; G3: 27%) of scorable items correlate significantly 
with the Verbal Score. Block Design appears to be a relatively good measure of the 
WISC-R performance construct with the children. The obtained item order for this 
subtest is also highly consistent with the standardized order (G1: rho=.98; 
G2: rho=.99; G3: rho=.99); with no items misplaced by more than two ranks. 
While, for G1, 33% (G2: 2 items; G3: 3 items) of items fall within the moderate dif- 
ficulty range, another 33% (G2: 45%) fall within the extreme deciles of difficulty. 
For G1, a moderate increase in difficulty is apparent with item #4 (no block 
outlines in model), and again with item #7. For G2, Block Design shows a relatively 
stable and modest gradient of increasing item difficulty, while for G3, a sharp in- 
crease is apparent at item #10 while the first six items are clearly too easy. 


Across Age Trends 


Overall, a number of trends are apparent in WISC-R item-total correlations, 
relative difficulty, and ordering across age for the Inuit sample.*,° With respect to 
item-total correlations, the proportion of scorable Verbal Scale items that correlate 
significantly with the Verbal Score appears quite stable at approximately 69-73% 
across age. Similarly, the proportion of scorable Performance Scale items that cor- 
relate significantly with the Performance Score also stays relatively constant at ap- 
proximately 65-73% across age. However, the proportion of scorable Verbal Scale 
and Performance Scale items that correlate strongly (>.34) with both the Verbal 
Score and Performance Score decreases steadily with increasing age. Thus, gener- 
ally, WISC-R test items appear to tend toward increasing differentiation with re- 
spect to one or the other of the WISC-R scales with increasing examinee age. With 
respect to item-total test correlation, the majority of test items correlate signifi- 
cantly with Full Score, but there is a general trend toward a slight decrease in 
number of scorable items with significant item by Full Score correlations with in- 
creasing age—a trend shown most strongly in Arithmetic, Digit Span, and Picture 
Completion. 

Overall, the item ordering for subtests as given in the WISC-R Manual 
(Wechsler, 1974) is relatively consistent with an order based on presently obtained 
relative item difficulties for all three age pools. Only 11-16% of all scorable test 
items are misplaced by more than two ranks from standard in any age group. The 
two subtests that consistently contain the greatest relative proportion of signifi- 
cantly misplaced items are Picture Completion and Comprehension; whereas Arith- 
metic, Digit Span and Block Design contain no items that are significantly 
misplaced. Only 17 items (#8 in Information, #3 in Similarities, #9, 10 in 
Vocabulary, #11, 12, 15, 16 in Comprehension, and #10, 11, 12, 13, 15, 18, 21, 24, 
25 in Picture Completion) are significantly misplaced in the same direction for two 
or more of the three age groups. These items need to be further examined for pos- 
sible cultural bias and possibly modified, replaced, or reordered within their re- 
spective subtests for future use with Inuit children. 

Of possibly greater import to the use of the WISC-R with Inuit children is the 
finding that only 20-24% of Verbal Scale scorable items and 10-43% of 
Performance Scale scorable items fall within the moderate range of relative diffi- 
culty while fully 39-51% of Verbal Scale items and 32-51% of Performance Scale 
items fall within the extreme deciles of relative difficulty and fail to add signifi- 
cantly to test variance. The proportion of scorable items in the moderate range 
stays quite stable across age for the Verbal Scale, but decreases steadily with in- 
creasing age for the Performance Scale. For the Performance Scale, a sharp de- 
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crease in the proportion of extremely difficult items (15% to 2%) and a sharp in- 
crease in the proportion of too easy items (17% to 49%) is evident with increasing 
age. Thus it appears that the Performance Scale suffers from a relatively low per- 
formance ceiling with these children. 


Conclusions 


Overall, the WISC-R appears to be a reasonably consistent measure of the 
Wechsler verbal, performance, and global abilities constructs with Inuit children. 
The vast majority of test items demonstrated adequate and appropriate discrimi- 
nation between the Wechsler verbal and performance constructs. As with the origi- 
nal WISC-R standardization sample, the Arithmetic and Digit Span subtests ap- 
pear to measure some other, orthogonal, construct with the Inuit sample. This is 
particularly true of the older Inuit whose responses to these two subtests show par- 
ticularly weak relationships to their responses on the other seven subtests. Possibly 
this other construct is similar to the Freedom from Distractability construct 
suggested by Kaufman (1975) based on his factor analysis of the WISC-R 
standardization data, although a recent study by Stewart and Moely (1983) 
suggests that Arithmetic and Digit Span may be measuring quite different cogni- 
tive processes and, therefore, may not be a unitary third WISC-R construct. Little 
evidence was found to suggest the necessity for major item ordering changes or 
content revision beyond those already made to “Canadianize” the WISC-R; al- 
though a small number of items were consistently misplaced across age levels. 


Of somewhat greater concern is the present finding, similar to Seyfort et 
al., (1980), that only approximately one-third of scorable WISC-R items at any age 
were of moderate difficulty (P-value=.30 to .70) for the Inuit sample while roughly 
another third fell within the extreme deciles of relative difficulty and failed to add 
significantly to total test response variance. Although no similar item difficulty 
data has been published for the original WISC-R standardization sample to permit 
direct comparison, it would seem likely that these figures for the Inuit sample rep- 
resent a significant increase in overall WISC-R difficulty compared to that shown 
by majority children. If this is indeed the case, Canadian indigenous children would 
tend to be evaluated on the basis of fewer test responses and, therefore, WISC-R 
measurement reliability would be compromised to some extent. With the Inuit chil- 
dren, an overall increase in the relative difficulty of the WISC-R (particularly the 
Verbal Scale items) may be a function of any of the factors commonly given as rea- 
sons for Native disadvantage on intellectual assessment instruments. That is, 

bilingual language orientations and nonverbal childrearing styles, coupled with nonoral- 
visual modality preferences, may reduce the Indian child’s accumulation of standard 
English language content and efficient non-Indian linguistic processing capabilities. An 
emphasis upon social and emotional stimuli and relationships rather than upon transfer 
of abstract information and factual knowledge may be reinforced through traditional cul- 
tural values, low socioeconomic status, and lack of school success. In addition, an ex- 
tremely high incidence of otitis media and consquent hearing loss in Indian groups may 
affect psycholinguistic development, reducing auditory-verbal reception, association, 
sequencing, and expressive abilities. (McShane & Plas, 1984, p. 70) 


A strategy of renorming the WISC-R with Native children may operate to adjust 
overall scores in a manner to compensate for increased test difficulty due to socio- 
economic disadvantage and/or cultural differences, but fails to resolve a possible 


measurement problem and may actually decrease the test’s criterion predictive 
validity. 
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Clearly, a renorming strategy should go hand in hand with studies of test crite- 
rion validity and measurement reliability. Further research is necessary to evaluate 
the WISC-R’s test-retest reliability and predictive validity with Inuit children. At 
present, the use of appropriately renormed standardization tests and a “social 
utilities” approach (Novick & Petersen, 1976; Petersen & Novick, 1976) in selection 
and placement decisions appears to be the most politically and socially palatable 
methodology for ensuring fairness to individuals and social responsiveness to mi- 
nority and/or culturally-different groups (Reschley & Sabers, 1979). 


Notes 


ily Requests for reprints should be addressed to Robert F. Mulcahy at the Department of Educational Psychology, University 
of Alberta, Edmonton, Alberta, Canada, T6G 2G5. The collection of the data upon which the present paper is based was 
funded by the Department of Education, Government of the Northwest Territories. The production of this paper was par- 
tially supported by the Social Sciences and Humanities Research Council of Canada through a Doctoral Fellowship to the 
first author. The authors wish to acknowledge and thank David W. Matheson, Department of Educational Psychology, 
University of Alberta, for his invaluable assistance. 


QD Mulcahy, R., & Watters, B. (1983). Report on Phase 1: Kitikmeot & Keewatin Region Norming Project. Unpublished doc- 
ument. Available from the Department of Education, Government of the Northwest Territories, Yellowknife, N.W.T., 
Canada. 


ou For Picture Completion, items remained unchanged; however, only for the first nonresponse, the examiner said: “Look 
carefully and tell me what is missing.” For Similarities, an item was added at the end of the subtest, (“How are a kilogram 
and a meter alike?”), but was not included in the scoring. Also, when a child responded simply “yes” or “no,” the examiner 
asked: “Tell me how?” Arithmetic item #10 was replaced with: “If one candy bar costs $.08, how much will three candy bars 
cost? ” and item #11 replaced “supermarket” with “the Bay,” (contraction for the Hudson’s Bay Company stores common 
in the Canadian North). For Vocabulary, examiners were told to use up to two queries if, after one query, the child gave only 
part of the answer and had stopped. 


4, Inspection of the tables will provide information on new placement order in terms of difficulty level for the Inuit children. 


5. Without reporting mean “IQ” values for the Inuit children, which are misleading given that the test items show bias for the 
sample, it will be noted that, using the original Wechsler norms, none of the 366 children scored above 119 on Full Scale IQ 
(i.e., 0% in the superior to very superior range) and 32.24% scored below IQ=70, in contrast to Wechsler’s reported 
percentages of 9.7% and 2.2%, respectively, for the normative group. 


6. It will be recalled that all item response frequencies and item difficulty gradients involved a 0-1 scoring system. As noted 
earlier, three of the subtests are unaffected. For the other six, there is some reduction of response variance and lowered 
item-total correlations; however ordinal consistency, a major concern of the paper, is increased. These limitations must be 
considered in interpreting the findings. 
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Data from a national sample of 2,025 13-year-olds participating in the National As- 
sessment in Science in 1981-82 was used to test a model of educational productivity 
involving factors including ability, motivation, quantity and quality of instruction, 
and the environments of the home and class. When a set of 11 individual predictors 
was regressed on achievement and attitude, the educational productivity model was 
generally supported. With predictor variables controlled for one another, student 
achievement was related to ability, motivation, science teaching budget, years of sci- 
ence, amount of homework, class environment, home environment, amount of televi- 
sion viewing (negative association), gender, and race. Student attitudes were found to 
be positively related to ability, motivation, attitude to the teacher, amount of home- 
work and class environment, and negatively to the amount of television viewing. 


The aim of the study was to use data from a large national random sample of 
students in the U.S.A. to investigate the influence on cognitive and affective 
outcomes of a set of productive factors in learning (including ability, motivation, 
quantity and quality of instruction, and the environments of the home and class). 


Background and Theoretical Framework 


According to Walberg’s theory of educational productivity (Walberg, 1981, 
1983, 1984a,b; Horn & Walberg, 1984: Walberg, Tsai, & Harnisch, 1984), nine fac- 
tors require optimization to increase student achievement of cognitive and affective 
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outcomes. These factors are the student aptitudinal variables of: (1) ability or prior 
achievement, (2) age, (3) motivation or self-concept as indicated by personality 
tests or willingness to persevere on learning tasks; the instructional variables of: (4) 
quantity of instruction, (5) the quality of the instructional experiences; and educa- 
tionally stimulating psychological aspects of the: (6) home environment, (7) the 
classroom or school environment, (8) the peer group environment and (9) the mass 
media (especially television). 

Compared with these nine factors, economic, political, and sociological charac- 
teristics are less relevant to learning because their influences are less alterable, di- 
rect, and observable; they are not substitutes for the nine factors, but more distant 
forces which can support or interfere with them. The nine productive factors large- 
ly were identified from syntheses of about 3,000 individual studies of factors re- 
lated to student learning (see Walberg, 1984a,b). Some of these syntheses have in- 
volved relating student outcomes to motivation (Uguroglu & Walberg, 1979), quan- 
tity of instruction (Frederick & Walberg, 1980), home environment (Iverson & 
Walberg, 1982), classroom environment (Haertel, Walberg, & Haertel, 1981), and 
leisure-time television viewing (Williams, Haertel, Haertel, & Walberg, 1982). 


Few prior intensive experiments and quasi-experiments are national in scope, 
and most analyze only one or two of the factors and sample limited populations 
within a school or community. They are often strong on observational technique, 
measurement, verification, and random assignment to treatments (in short, inter- 
nal validity), but they are often weak in generalizability or external validity since 
they do not sample rigorously from large, well-defined populations. Survey research 
has complementary strengths and weaknesses: it often draws large, stratified, ran- 
dom samples of national populations and measures more factors, but sacrifices in- 
ternal validity since the factors are usually measured cross-sectionally and perhaps 
superficially with only a few items. Also survey research can control statistically to 
some extent for multiple causes and can be more causally convincing than quasi- 
experiments controlled only for one or two covariates. 


The complementarity of intensive and extensive studies, however, is important. 
In principle, consistent, powerful effects should emerge consistently from either 
form of research as well as from case studies. Robust findings give a more solid 
basis for educational policy and practice. In fact, approximately 70 syntheses of 
several thousand intensive studies (Walberg, 1984b) support the consistently posi- 
tive influences or correlations of the nine productivity factors or their more specific 
aspects. The expected negative association of achievement with hours of television 
viewing beyond ten per week is the exception (see Williams, et al., 1982). 


In addition, a synthesis of nine regression studies of extensive survey data, 1n- 
volving 15,802 13- and 17-year-old students tested in mathematics, science, social 
studies, and reading by the National Assessment of Educational Achievement, 
supports the sign of these findings (Walberg, 1984b). Although the correlations of 
the factors with achievement and subject-matter interest as learning outcomes 
varied from -0.45 to +0.68, 83 (or 91 percent) of the 91 correlations were in the ex- 
pected direction. Moreover, when the factors were controlled for one another in 
multiple regressions, 58 (or 91 percent) of the 64 coefficients carried the expected 
sign. Although the magnitudes of the correlations depended slightly on the type of 
learning outcome (achievement or attitude), subject matter, and productivity fac- 
tor, they averaged +0.19, and most ranged between +0.05 and +0.40. The stand- 
ardized regression weights, moreover, were nearly all in the expected direction and 
not significantly different across age, subject matter, type of outcome, and produc- 
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tivity factor. The analyses, however, revealed collinearity among the factors; stu- 
dents advantaged on one also tended to be advantaged on the others. 


The results suggest further that no single factor is dominant in determining 
learning outcomes and that a one standard-deviation (SD) increment in any factor 
(with other factors held constant) is associated with a 0.05 SD increment in learn- 
ing from, for example, the 50th to the 52nd percentile, which represents a trivial 
advantage. Raising the complete set of factors one SD, however, is associated with 
a l7-percentile increment from the 50th to the 67th percentile. To take a more ex- 
treme example, an increment of two SDs from the 50th to the 98th percentile on all 
the factors is associated with an increment of 32 percentile points from the 50th to 
the 82nd percentile of learning. 


Prior syntheses of intensive and extensive studies thus suggest that no single 
factor can serve as a panacea. Improving all the factors, using scarce resources of 
human time and effort as efficiently as possible, would seem a more advisable pol- 
icy than improving only one, which may lead to diminishing returns. Most of the 
evidence, however, has been assembled from studies of a decade or more ago, and it 
would be useful to test the generality of the results for more current data. Thus 
the purpose of the present study was to compare regressions of National Assess- 
ment in Science data recently collected in 1981-82 with previous regressions and 
syntheses of smaller-scale experimental and quasi-experimental studies. 


National Assessment in Science 


The National Assessment of Educational Progress (NAEP) was established in 
the U.S.A. in 1969 to assess periodically students’ knowledge of various school 
subject areas. National assessments in science were conducted by NAEP in 1969- 
70, 1972-73, and 1976-77. But, because of legislative decisions and financial con- 
straints, the National Institute of Education postponed the next fully-fledged 
NAEP science assessment until the late 1980s. Many science educators were con- 
cerned that this hiatus would permit emerging problems to go unchecked, and so 
the National Science Foundation funded a national assessment which would fill 
this void. Consequently, during 1981 and 1982, a project under the direction of 
Wayne Welch at the University of Minnesota involved the collection of data on 
many variables including student cognitive and affective outcomes and characteris- 
tics of the home, the community, and the school (Hueftle, Rakow, & Welch, 1983; 
Welch, 1985). This national assessment of educational progress in science involved 
a national random sample of approximately 18,000 students of ages 9, 13, and 17 in 
about 700 schools in the U.S.A. Examples of some of the uses of this data base in- 
clude examining secondary school science enrollments in the United States (Welch, 
Harris, & Anderson, 1984), inequities in opportunities for computer usage in 
schools (Anderson, Welch, & Harris, 1983), and perceptions of secondary school 
students toward women in science (Welch, Rakow, & Harris, 1984). 


Because the main purposes of this and previous NAEP surveys are to assess the 
status of scientific literacy among various groups, and to compare this with images 
assessed in 1976-77, many analyses simply focus on the proportion of respondents 
in different subgroups answering each item correctly. For example, Rakow, Welch, 
and Hueftle (1984) report this type of data and provide the interesting conclusion 
that, whereas 9-year-olds increased a small amount in their science achievement 
over the past five years, 13-year olds showed essentially no change while 17-year- 
olds experienced a decline. But, because of the large number of individual test 
items answered by large numbers of students, the National Assessment in Science 
data provide an extremely useful basis for performing secondary analyses which in- 
terrelate variables (often defined in terms of subsets of individual items). In fact, 
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provision of a data base for science education research was a recognized aim of 
the 1981-82 assessment, and so, the survey included many school and student back- 
ground characteristics. For example, some secondary analyses of the National As- 
sessment in Science data already completed have explored questions such as the 
stability of attitudes to science between junior and senior high school (Hofstein & 
Welch, 1984) and the influence of class attitudes and teacher image on student 
outcomes (Tamir, Welch, & Rakow, in press). 


Operationalizing Measures of Outcome Criteria and 
Productivity Factors 


As the National Assessment in Science data included items which could be in- 
terpreted as measures of most of the factors in Walberg’s educational productivity 
model, secondary analyses were conducted for the purpose of probing the validity 
of the model. The first step involved using the data base to operationalize two 
measures of student outcomes and at least one measure of seven of the nine pro- 
ductive factors in the model. Appendix A provides detailed descriptions and 
operational definitions of each of the variables involved, including scale reliability 
estimates where appropriate. 

A cognitive achievement measure consisted of 49 multiple-choice items covering 
content topics, inquiry skills, and understanding of societal issues (alpha reliabil- 
ity=0.80). An attitudinal outcome measure consisted of 19 Likert-type items as- 
sessing opinions about the value of science and willingness to solve social problems 
(alpha reliability =0.68). Ability was assessed with a self-report item asking about 
students’ previous grades in school. Age was excluded because the sample consisted 
only of 13-year-olds and therefore exhibited very little variability. Motivation was 
assessed by eight items asking about the frequency of voluntary participation in 
science-related activities (alpha reliability=0.78). Quality of instruction was 
assessed by two different variables, namely, the average science teaching budget 
per pupil (as reported by the school principal) and students’ attitudes toward their 
teacher’s enthusiasm, and so forth (assessed by five Likert-type items with an 
alpha reliability of 0.66). The quantity of instruction also was assessed by two vari- 
ables, namely, the number of years of science taken over the previous two years 
and the average number of hours of homework per day. The class environment was 
measured by six items asking students how they felt during science classes (alpha 
reliability =0.68). The home environment was assessed in terms of the higher of the 
ratings for father’s and mother’s education coded on a scale of one to six. The peer 
group environment was not assessed. The mass media environment was assessed in 
terms of the number of hours of television watched during the previous day. In ad- 
dition to the above factors in the productivity model, the present study involved 
the two extra variables of gender and race (white vs. non-white) because they had 
been found to be good predictors of learning among science students in the present 
sample. Of course, should these extra variables of gender and race prove to be sig- 
nificant independent predictors of learning, this would suggest an omission in the 
original productivity model. 

These descriptions of variables show that several of the productive factors are 
incompletely, crudely, or unreliably indexed by the questionnaire items available as 
part of the National Assessment in Science. As this leads to underestimates in the 
magnitude of associations between learning and productive factors, it is likely that 
future studies involving better measures of variables would yield even stronger re- 
lationships than those emerging in this study. 
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Sample 


The National Assessment in Science selected respondents of ages 9, 13, and 17 
using a stratified, two-stage probability sample design to ensure proportional repre- 
sentation by regions, males and females, different racial groups, and size and type 
of community. The first stage of the sampling involved dividing the U.S.A. into pri- 
mary sampling units consisting of geographic groupings with a minimum popula- 
tion size requirement. These primary sampling units were stratified according to 
four regions (Northeastern, Southeastern, Central, and West) and five sizes of com- 
munity strata (ranging from large cities with more than 200,000 population to ex- 
treme rural areas of less than 10,000). The sample was drawn with probabilities 
proportional to size to represent all regions and community sizes. Oversampling of 
low-income and rural areas ensured adequate representation of these groups. In the 
second stage of sampling, 125 schools were randomly chosen with probabilities pro- 
portional to the size of the school. Finally, a random sample of approximately 16 
students was selected from each school. 


Although the total sample consisted of approximately 18,000 students in about 
700 schools throughout the U.S.A., the group of almost 8,000 13-year-olds tested in 
298 schools was selected for consideration for the present secondary analyses. Be- 
cause the study involved a matrix sampling plan whereby different students 
completed different parts of the total test battery, the secondary analyses were re- 
stricted to a subgroup of 2,025 13-year-olds who had completed all items in one test 
booklet which made up the variables in the productivity model. In an attempt to 
ensure uniformity of testing conditions, test administrators visited each school to 
give the instruments using a timed and paced audiotape. 


Analyses and Results 


Table 1 provides some descriptive statistics for each of the outcomes measures 
and productivity factors. This information includes the mean and standard devia- 
tion of each variable and the intercorrelations among variables for the present sam- 
ple of 2,025 students. For the two categorical variables of gender and race (which 
involved binary coding with 1=male, 0=female, 1=white, and 0=non-white), the 
mean simply represents the proportion of males or whites present in the total sam- 
ple. For example, Table 1 shows that the sample was made up of 48% of males and 
79% of whites. 

Table 1 shows that some of the stronger simple correlations with cognitive 
achievement were for ability (0.30), class environment (0.25), and race (0.28). The 
larger correlations between attitudes and productive factors emerged for instruc- 
tion (0.43), attitude to the teacher (0.35), and class environment (0.36). 

Table 2 shows the results obtained when the whole set of 11 predictors was re- 
gressed on achievement and attitude. Use was made of a “backwards elimination” 
method in which variables were progressively dropped from the full regression 
model containing all predictors until all variables remaining had regression weights 
significantly different from zero (p<0.05). The main statistics reported for each 
productive factor in Table 2 are the raw regression weight, b, and the result of a t- 
test of whether the magnitude of the regression weight is greater than zero. Table 2 
also shows that the multiple correlation for the whole of a set of predictor variables 
was 0.52 for achievement (27% of variance accounted for) and 0.51 for attitude 
(26% of variance accounted for). The total amounts of variance are not large and 
suggest the need in future national assessments of better and more sensitive meas- 
ures of many of the educational productivity factors. 
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TABLE 2 


Raw Regression Weights and t Ratios For Each Significant 
Productive Factor For 13-Year-Olds (N= 2,025) 


Productive Factor Achievement Attitude 

b iB b il 
Ability 0.81 9.28** 0.38 a0 
Motivation 1.09 (ae toy P2AO0e* 


Quality of instruction 
Science teaching budget/ 


pupil 0.04 Ye spe 
Attitude to teacher 216 8.37** 
Quantity of instruction 
Years of science 1.66 8.00** 
Hours of homework per day 2.05 WAT he: 4.30 4.04** 
Class environment 1.14 yy Wi feted 1.70 6:07** 
Home environment 0.51 oor 
Hours of TV per day a i7 oD70** =0:21 -2.76** 
Gender 1.56 bes0"* 
Race (white, non-white) 4.12 12.44** 
Multiple R 0.52 0.51 


*p<0.05, **p<0.01 


The number of significant independent predictors for the achievement outcome 
was 10 for the cognitive outcome and six for the affective outcome (see Table 2). 
When controlled for all other variables, ability, motivation, science teaching 
budget, the number of years of science taken over a two-year period, the amount of 
homework, the classroom environment, the home environment, the amount of tele- 
vision viewing (negative direction), gender, and race were all significantly related to 
science achievement. For the attitude outcome, the variables linked with science at- 
titudes, when controlled for other variables, were ability, motivation, attitude to 
the teacher, amount of homework, classroom environment, and amount of televi- 
sion viewing (negative direction). 

Raw regression weights are reported in Table 2 because they indicate the 
change in the number of points on an outcome measure associated with a one-unit 
increment in each independent variable when all other independent variables are 
held constant. Of course, corresponding standardized beta weights could be calcu- 
lated readily using the standard deviations listed in Table 1. For example, Table 2 
shows that, for cognitive achievement and with other factors held constant, an in- 
crease of one hour per day in the time spent on homework is associated with 2.05 
points increase, while a decrease of one hour of television viewing per day is associ- 
ated with an increase of 0.17 of a point. Also for the binary variables coded 0 and 1, 
regression weights can be interpreted directly as group differences. For instance, on 
cognitive achievement, boys scored 1.56 points higher than girls and whites scored 
4.12 points higher than non-whites when the other productive factors were held 
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constant. That is, with other factors fixed, white boys scored 5.68 (about five-sixths 
of a standard deviation) higher than non-white girls on the achievement test. 


It is interesting to note that several relatively unalterable factors—specifically, 
ability, home environment, gender, and race—are significant predictors of student 
cognitive achievement. It is also somewhat disappointing to find that the two meas- 
ures of quality of instruction—namely, the science teaching budget per pupil (mar- 
ginally significant) and students’ attitudes to their teacher (nonsignificant)—are 
relatively weak predictors of achievement. Nevertheless, six relatively alterable fac- 
tors were still found to be significant independent predictors of science students’ 
cognitive achievement; these are motivation, science teaching budget, the number 
of years of science, the amount of time devoted to homework, the class environ- 
ment, and the amount of television viewing (negative relationship). 

Five of the factors found to be significant independent predictors of cognitive 
achievement also were found to be significant independent predictors of student 
attitudes; these are ability, motivation, amount of homework, class environment, 
and amount of television viewing. But, additionally, attitude to the teacher proved 
to be a significant independent predictor of students’ attitudes. On the other hand, 
five factors important in predicting cognitive achievement—namely, science teach- 
ing budget, years of science, home environment, gender, and race—all turned out to 
be unimportant in predicting affective outcomes when other factors are controlled. 
Still, there are a number of alterable factors which teachers can concentrate on if 
they wish to attempt to improve students’ attitudes to science. 


The positive influence of the home environment on cognitive achievement is 
consistent with syntheses of prior research studies (Graue, Weinstein, & 
Walberg, 1983; Iverson & Walberg, 1982). Similarly, the negative association be- 
tween achievement and amount of television viewing is consistent with Williams, 
Haertel, Haertel, and Walberg’s (1982) research synthesis which revealed a delete- 
rious effect beyond 10 hours of viewing per week. The finding of links between 
achievement and the two measures of quantity of instruction in the present study 
(namely, number of years of science and amount of homework) is consistent with 
Frederick’s (1980) review of research on instructional time. The magnitudes of the 
correlations emerging in the present study, however, are considerably smaller than 
the correlation of 0.73 found between mathematics achievement and the number of 
semesters of mathematics instruction in Welch, Anderson, and Harris’ (1982) an- 
alysis of the 1977-78 NAEP data of 17-year-olds. These differences in the sizes of 
the correlation between achievement and quantity of instruction for science and 
mathematics might be explained partly by the fact that there are more 
opportunities for out-of-school learning in science (e.g., museums, zoos, television) 
than in mathematics. 


It is noteworthy that the nature of the classroom psychosocial environment 
emerged as a significant predictor of both science achievement and attitudes to sci- 
ence when other factors were held constant. The simple outcome-class environment 
correlations of 0.25 for achievement and 0.36 for attitudes to science are among the 
larger correlations found for any of the productivity factors in Table 1. These find- 
ings replicate individual studies of the effects of classroom environment on student 
achievement (Fraser & Fisher, 1982) and attitudes (Fraser, 1981) and are 
consistent with Haertel, Walberg, and Haertel’s (1981) meta-analysis. 


Conclusion and Discussion 


The results show that a number of factors, previously revealed to be consistent 
correlates of achievement in syntheses of small-scale bivariate research, were found 
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also to be significant predictors of student outcomes when mutually controlled for 
each other in a large national survey. This suggests that national achievement and 
attitudes are jointly influenced by a number of factors rather than by a single dom- 
inant one. Moreover, the results of these secondary analyses generally support the 
validity of the model of educational productivity (although two factors not in- 
cluded in the model, namely, gender and race, proved to be significant independ- 
ent predictors of cognitive achievement). 


The present research has identified four productivity factors which predict cog- 
nitive achievement and which are relatively unalterable by the school; these are 
ability, home environment, gender, and race. Of these four variables, ability is the 
only factor also associated with student attitude. Nevertheless, there are five rela- 
tively school-alterable factors (namely, motivation, quality of instruction, quantity 
of instruction, class environment, and amount of television viewing) which were sig- 
nificant independent predictors of both achievement and attitude. It is encouraging 
to find that there are numerous school-alterable factors which teachers can work on 
if they wish to improve the cognitive and affective performance of their junior high 
school students. 


This study highlights the problem arising from the mismatch between the 
purposes of the primary and secondary analyst. The national assessment data con- 
tained only crude and insensitive measures of several of the productivity factors. 
This is not surprising, of course, since the original survey was not designed for the 
secondary analyst’s purpose of testing the productivity model. Because of the gen- 
erality of the factors in the productivity model and the model’s potential for im- 
proving learning, it would be highly desirable for future surveys of the National As- 
sessment of Educational Progress to incorporate high-quality measures of the pro- 
ductivity factors. 

Despite some acknowledged shortcomings, this paper still attests to the poten- 
tial usefulness of secondary analyses of data from large-scale assessments (Fraser & 
Tobin, 1985). Clearly, there were important advantages in using the National As- 
sessment in Science data base in the present study when compared with collecting 
primary data to investigate the same questions. These include the reduction in re- 
sponse burden on students and schools, the low cost of data, the quality of the 
large, national, random sample, the generalizability of results, and the possibility of 
future replications. Although a potential problem commonly facing secondary ana- 
lysts is difficulty in understanding and using the data because of its complexity or 
inadequate documentation, fortunately, this was overcome with the National As- 
sessment in Science data because of thorough documentation, careful sampling and 
test administration, and the use of a variety of quality checks at every stage of as- 
sessment and analysis. 

Given the limitations in the quality of some of the measures of the productivity 
factors, more sophisticated analyses than those reported here would not be 
justified. Nevertheless, in future tests of the productivity model using more sensi- 
tive measures, benefit is likely to be obtained by using more sophisticated path an- 
alysis models, such as LISREL (Joreskog & Sorbom, 1984), in which some of the 
productive factors can be treated as endogenous interviewing variables. As well, 
analyses could try to establish directional links between achievement, attitude, and 
productivity factors. This use is illustrated by Riley and Schibeci (1983), who used 
LISREL with the 1976-77 NAEP science data to show that attitude influenced 
achievement, while background factors similar to the productivity factors affected 
both achievement and attitude. 
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Future secondary analyses of data bases containing more sensitive measures 
also might usefully examine non-linear relationships between productivity factors 
and outcomes. Walberg (1981) has noted that there are likely to be “diminishing re- 
turns” associated with raising several of these factors (e.g., quantity of instruction). 
Amount of homework provides a good example of a variable which is likely to bear 
a non-linear relationship with achievement. Similarly, Williams, Haertel, Haertel, 
and Walberg’s (1982) meta-analysis has shown that the negative relationship be- 
tween television viewing and achievement was only evident in the range between 
two and eight hours (so that a deleterious effect was not present under two hours 
per day and over eight hours per day). 

There remain numerous important research questions which cannot be explored 
using currently available data bases of the National Assessment of Educational 
Progress. Consequently, a desirable thrust for national assessments in the future 
would be the inclusion of measures and variables which would allow researchers to 
investigate the reasons why productivity factors promote student achievement and 
attitude. For example, what is the mechanism by which home environment, gender, 
and race influence student achievement? Clearly these variables are proxies for fac- 
tors in the home, the school or elsewhere which are the real causes of differential 
performance. If the data from national surveys could be utilized to pursue these 
important questions, it is likely that researchers would uncover information to 
guide our attempts to reduce the large differences in achievement evident among 
students differing on such dimensions as home background, gender, and race. 


The findings emerging from the present analyses refute several claims made in 
relation to school learning and assumptions made in educational research. First, al- 
though much prior research has been bivariate in nature, this study clearly illus- 
trates that no single factor alone can produce marked increases in learning. 
Improving all the productive factors using scarce resources, including time and ef- 
fort, as efficiently as possible would seem a more advisable policy than improving 
only one. Second, the results are at variance with another commonly held view that 
school learning is too complex to be understood in terms of a relatively small num- 
ber of underlying factors. The third notion dispelled by the present findings is that 
the only important factors in predicting student outcomes are those that cannot be 
altered by teachers or the school. 
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Appendix A 


Descriptions and Operational Definitions of Variables 


Achievement Outcome 


Forty-nine multiple-choice items covering science content topics (e.g., energy, 
life, changing aspects of the earth), inquiry skills (including experimentation, data 
interpretation, measurement, problem definition and solution), understanding of 
the role of theories in science and an appreciation and understanding of societal is- 
sues (including health, resource management, nutrition, safety, and the costs and 
benefits associated with applied science and technology). Maximum score=49. 
Alpha reliability =0.80. 


Attitude Outcome 


Nineteen Likert-type items assessing students’ opinions about the usefulness 
and value of science and willingness to solve social problems (e.g., by turning off 
lights when they are no longer needed). Alpha reliability=0.68. Each item is re- 
sponded to on a five-point scale. Items are coded as: 

5 = Always or Strongly agree or Definitely yes 
4 = Often or Agree or Probably yes 
3 = Sometimes or No opinion or No response 
= Seldom or Disagree or Probably not 
1 = Never or Strongly disagree or Definitely not 
Some items are scored in the opposite direction. Maximum score=95. 


Ability 
A self-report item asking students to describe grades in school so far coded as: 
8 = Mostly A . 
7 = About half A and half B 
6 = Mostly B 
5 = About half B and half C 
4 = Mostly C 
3 = About half C and half D 
2 = Mostly D 


1 = Mostly below D 


Motivation 


The mean of eight items asking how often science-related activities (e.g., read- 
ing science articles in magazines, watching science shows on TV, going to hear 
people give talks on science) have been done when not required for science classes. 
Alpha reliability =0.78. Coded as: 

5 = Often 

4 = Sometimes 


3 = No opinion or No response 
2 = Seldom 
1 = Never 
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Science Teaching Budget 


Science teaching budget (in dollars) per student obtained from a principal’s 
questionnaire reporting the school’s total instructional budget, the percentage de- 
voted to science and the number of students in the school. 


Attitude to Teacher 


The means of five Likert-type items requesting students’ opinion about their 
present or most recent science teacher. The items ask whether the teacher “really 


likes science,” “wants students to point out mistakes. . .,” “makes science exciting,” 
“is enthusiastic,” and “is willing to share opinions. . ..” Alpha reliability=0.66. 
Coded as: 

5 = Strongly agree 

4 = Agree 

3 = No opinion 

2 = Disagree 


1 = Strongly disagree 


Quantity of Science Instruction 


Based on two items asking students if they are taking a science course currently 
and whether they were taking one at the same time during the previous year. 
Coded as: 

0 = taking a science course in neither year 
1 = taking a science course in one of the years 
2 = taking a science course in both years 


Homework 


An item asking students to indicate the average amount of time per day spent 
on homework (all school subjects). Coded as: 
0 = None 
1 = Less than one hour per day 
2 = Between one and two hours per day 
3 = More than two hours per day 


Class Environment 

The mean of six items asking students if science classes made them feel “un- 
comfortable” (reverse scoring), “curious,” “stupid” (reverse scoring), “confident,” 
“successful” and “unhappy” (reverse scoring). Alpha reliability=0.68. Except for 
items with reverse scoring, coding is: 


5 = Always 

4 = Often 

3 = Sometimes 
2 = Seldom 

1 = Never 


Home Environment 

The higher of the ratings for father’s and mother’s education coded as: 
1 = Did not complete eighth grade 
2 = Completed eighth grade, but did not go to high school 
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3 = Went to high school, but did not graduate from high school 
4 = Graduated from high school 

5 = Some education after graduation from high school 

6 = Graduated from college 


Ve 


An item asking students how many hours of TV they watched during the previ- 
ous day. Coded as: 
1 = Watched one hour or less 
2 = two hours 
3 = three hours 
4 = four hours 
5 = five hours 
6 = six hours or more 


Gender 


Coded as: 
1 = Male 
0 = Female 


Race 
Racial background with blacks, native Americans, Hispanics, and Asians in- 
cluded in the non-white category. Coded as: 
1 = White 
0 = Non-white 
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Student Orientations to School Rules! 


This paper uses data from a survey questionnaire administered to approximately 
8,000 high school students in 55 randomly selected schools in Newfoundland and Lab- 
rador to identify the social background, organizational, and student characteristics 
which are associated with student orientations to school rules. Analyses of students’ 
perceptions of number of rules, necessity of rules, necessity of following rules, 
impossiblity of obeying rules, frequencies of breaking school rules, whether it is right 
or wrong to break school rules, and frequencies of individual and group approaches in 
teacher-student interactions concerning the implementation of rules, indicate that 
while certain variables in each of the three categories (social background, organiza- 
tional and student characteristics) are more important than others, student charac- 
teristics are more often related to students’ perceptions of different dimensions of 
school rules than are social background and organizational characteristics. 


Rules, as a means of legitimizing rational-legal authority, are the core of the 
bureaucratic phenomenon in contemporary society. In line with Weber’s (1964, 
pp. 333-334) ideal type bureaucratic model, a system of rules covering the rights 
and obligations of its members is generally seen as an important characteristic of 
the school organizational entity. By way of setting the stage for the present focus 
on factors relating to student orientations to rules, it is appropriate to give a brief 
overview of particular organizational dimensions of the school. 


Organizational Dimensions of the School 


Two interrelated topics of long-standing interest in the organization of the 
school have particular relevance to the present research. One is the issue of student 
alienation and the other is student involvement in decision-making in the school. 


Dr. Macdonell is a Professor in the Department of Sociology, University of New Brunswick with interests in the sociology of edu- 


cation with particular reference to higher education. ’ the 
Dr. Martin is a Professor in the Department of Educational Foundations, Memorial University of Newfoundland, with interests 


in the sociology of education with particular reference to micro level processes in the school. 
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Using the concepts of “status maintenance” and “behavior control” to describe stu- 
dents’ perceptions of six dimensions of bureaucratic structure, Anderson (1970) 
found that students’ alienation is negatively related to their perceived status main- 
tenance. The implication is that student alienation may increase as bureaucratic ef- 
ficiency increases. 

Several Canadian studies have focused on student involvement in the decision- 
making processes in the school (Treslan, 1969; Wood, 1977; Alexander & 
Farrell, 1975; Hoffman, 1971; and Probert, 1971). These studies show a lack of 
student involvement in the decision-making processes in the school and a concomi- 
tant student desire for greater participation in this dimension of school life. The 
question of student orientation to school rules has been alluded to in these studies 
of students’ perceptions of their involvement in decision-making in the school. For 
example, in their study of students in Ontario, Alexander and Farrell (1975) found 
that students think they have a minimal influence in most areas of decision-making 
in the school. An Alberta study by Treslan (1969) found that, with few exceptions, 
teachers and administrators are perceived to be making all decisions concerning 
rules and regulations in the school. 


Even though it has been argued that the nature of teaching is such that “rule 
domination is much less compatible with the performance of school activities than 
it is with .. . the management of industrial production of tangible goods and serv- 
ices” (Dreeben, 1970, p. 47), rules are one of the dominating features of many high 
schools. There is usually a host of rules which attempt to govern student behavior 
from the moment they arrive on the school grounds, to their entering the school 
and their moving around in the corridors, to their visits to the washroom and other 
specifically defined areas in the school, including directions for a multitude of 
classroom activities and potentially problematic situations. In fact, schools often 
develop extensive lists of rules, and put considerable effort into communicating 
them through student handbooks, posting lists on bulletin boards and on classroom 
walls, and by repeating them during the teaching-learning processes. 


Given the widespread use of bureaucratic rules in the school in general, and in 
teacher-student interaction in particular, it is in many ways surprising that re- 
search on different aspects of this phenomenon is so sparse. A host of rule-related 
issues has yet to be studied and more fully understood in the quest to get at the 
complexities of the interrelationships among such things as student attitudes and 
behaviors, organizational features of the school and societal structures and pro- 
cesses. In attempting to understand these complexities, it is seen that one of the 
most salient features of the formal organization of the school is the rules, particu- 
larly those prescribing the rights and obligations of the students. The present re- 
search focuses on student orientations to school rules. More specifically, the intent 
is to see what social background characteristics of students, organizational charac- 
teristics of schools and student characteristics are related to and associated with 
student orientations to school rules. 


Data Collection and Analysis 


Data on which this publication is based were collected as part of a larger study 
of the schooling experience of high school students in Newfoundland and Labrador. 
The larger study used three data collection techniques: semi-case study, taped 
interviews, and a survey questionnaire. The questionnaire was administered to 
7,848 high school students from 55 schools representing 24 school boards in New- 
foundland and Labrador. Actually, 58 schools were selected by means of a random 
sample and stratified on the basis of community size. But the questionnaire was 
not administered in two of these schools because the school boards having jurisdic- 
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tion over them did not grant the researcher permission to approach them concern- 
ing this study. A third school in the sample had to be omitted because its principal 
refused to let the questionnaire be administered in his school. The sample of al- 
most 8,000 high school students is evenly divided along gender lines and propor- 
tional representation from each high school grade was attained. 


Data collected on parts of three questions on the survey questionnaire form the 
basis of the present analysis of student orientation to school rules. One of these 
questions asked students to indicate how often (never, seldom, sometimes, often, 
very often) they did certain things in the school building: 


- Break certain rules which are given concerning classroom behavior; 


- Break certain rules which are given concerning behavior outside of the classroom, but 
within the school or on the school grounds. 


Another question attempted to find out how often students, individually and/or 
in groups, approached their teachers as they attempted to get them to change their 
minds concerning a variety of issues in the school, including: 

- The following rules (for example, dress code, hair styles, smoking, where to walk in the 
school corridors, what to do on the school grounds, behavior while in the classroom and 
behavior while in other areas of the school). 


A third question asked students to indicate whether they agreed or disagreed 
with each of a list of 50 statements. The responses received on six of these state- 
ments form part of the data base for the present analysis. These six statements are: 

- Itis necessary to follow all the school rules all of the time. 

- There is nothing wrong with breaking certain school rules occasionally. 
- Itis impossible to obey all the rules in this school. 

- There are too many rules in this school. 

- Most school rules are necessary. 

- Itis okay to break the rules at school so long as you do not get caught. 


These questions provide data on students’ perceptions concerning eight dimen- 
sions of rules. One is their perceptions of the number of rules in the school. An- 
other is their perceptions of the necessity of rules. Thirdly, there is the extent of 
the student-perceived necessity to follow all school rules. Fourthly, the extent to 
which students perceive it is impossible to obey all the school rules is of interest 
here. Other areas of concern are students’ perceptions of the frequencies with 
which they break classroom rules and the frequencies with which they break other 
school rules. A seventh dimension of rules investigated here revolves around wheth- 
er it is right or wrong to break school rules, and under what circumstances it is per- 
ceived to be right or wrong to follow particular courses of action with respect to 
obeying rules. Finally, attention is given to the frequencies with which students 
perceive themselves to be using specific interaction approaches in attempting to get 
their teachers to change their minds concerning the enforcement of rules. Taken 
together these perceptions are referred to in the present paper as student 
orientations to school rules. 

Chi square test of significance is used to isolate the relationships between 
specific social background, organizational, and student characteristics, on the one 
hand, and student orientations to rules, on the other. The social background, or- 
ganizational, and student characteristics are further divided into categories of inde- 


pendent variables, as follows: 
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I. Social Background 


1. Family size 
- number of children in family 
2. Parental employment and occupation 
- whether or not father is employed 
- whether or not mother is employed 
- father’s occupation 
- mother’s occupation 
3. Parental education 
- secondary education (grade completed) 
- post-secondary education (attended university and/or trades college) 
4, Size of home community 


II. Organizational Characteristics 

1. Type and size of school 

- grades in school (high school grades, junior high and high school grades, 
all grade school) 

- number of classes in high school grades 
- number of students in high school 

2. School organizations 
- student representative council 
- a system of prefects 
- school clubs (organizations) 
- average size of school clubs 

3. School board 
- denominational schools (Integrated, Roman Catholic and Pentecostal) 


III. Student Characteristics 

1. Individual characteristics 
- gender 
- age 
- ethnicity 

2. Organizational membership 
- membership in student representative council 
- being a prefect 
- number of school clubs one has membership in 

3. Evaluation indicators 
- grade level 
- whether or not one failed a grade 
- number of grades failed 
- number of grades repeated 
- whether or not one failed a subject 
- number of subjects failed repeated 

4. Busing and mobility factors 
- whether or not one is bused to school 
- whether or not one has lived in the same community all one’s life 
- numbers of years one has lived in the present community of residence 
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The associations between these orientations to rules, taken as dependent vari- 
ables, and the specific social background, organizational and individual student 
characteristics as outlined above, taken as independent variables, are measured by 
using gamma. This test measures the relative reduction in errors of predicting 
order on the dependent variable. That is to say, the numerical value of gamma, dis- 
regarding whether it is positive or negative, gives the percentage of guessing error 
eliminated by using knowledge of an independent variable to predict order on a de- 
pendent one (Mueller, Schuessler, and Costner, 1977, pp. 187-188; 217-219). 


Findings 
In line with the categories of independent variables included in the present an- 
alysis, the findings are presented by giving separate attention to social background 
variables, organizational characteristics, and student characteristics. 


Social Background 


Summaries of chi squares and gammas for the relationships and associations 
between different dimensions of student orientations to school rules and each of 
the social background variables considered in the present study are given in 
Table 1. In connection with employment and occupation, it is interesting to note 
that none of the dimensions of student orientations is significantly related to 
whether or not father is employed, and only one of three dimensions is significantly 
related to whether or not mother is employed. The associations between these di- 
mensions of student orientations to school rules and father/mother employment are 
also low. The highest gamma is only .09. Six of the dimensions of student 
orientations to school rules are significantly related to father’s occupation and four 
are significantly related to mother’s occupation when occupations are categorized 
along the Blishen and McRoberts (1976) scale of social classes. Again, the associa- 
tions are generally low, with only two of a total of twenty gammas above .10 
(Table 1). 

None of the dimensions of student orientations to rules is significantly related 
to the father’s schooling or to the mother’s schooling, or to the question of whether 
or not the mother went to trades school, or to whether or not the mother went to 
university. Only one of these dimensions is significantly related to whether or not 
the father went to trades school, and only two of the ten orientations are related to 
whether or not father went to university. The associations between student 
orientations to school rules and education are low, with only three of the 
60 associations given in the education grouping of Table 1 being higher than gam- 
ma of .10. 

Of the four categories of social background variables (family size, employment/ 
occupation, education, and community size), community size seems to be most im- 
portant in terms of number of orientations significantly related to population of 
community, and in terms of the strength of associations between these two vari- 
ables. Seven of the ten dimensions of student orientations are significantly related 
to population of community, and four of the ten associations between these two 
variables range from gamma of .10 to gamma of .14. Even though these gamma are 
somewhat low, they are high in comparison to the associations between student 
orientations to school rules and the other social background variables considered in 


this analysis (Table 1). 
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Organizational Characteristics 

Table 2 contains summaries of the chi squares and gammas for the relation- 
ships and associations between student orientations to school rules and the vari- 
ables in each of the three groupings of organizational characteristics; type and size 
of school, school organizations, and school board. 


Student responses to five of the statements indicating their orientations to 
school rules are significantly related to type of school. The associations between the 
responses to some of these statements and type of school are noteworthy with 
gammas of .13 and .17. Both measures of school size (number of classes at grade 
level and number of students in high school grades) are important variables in that, 
except for one dimension, student orientations are significantly related to each of 
these variables. Even though a couple of the associations between these 
orientations and school size variables are almost nonexistent, four of the associa- 
tions have gammas at .13 and .14, and other associations range from .07 to .09 
(Table 2). 

A survey of the four school organizations reveals that only two of the student 
orientation variables are significantly related to the question of whether or not 
there is a student representative council in the school, four of the dimensions of 
student orientations are related to the existence of a system of prefects, three of 
the student orientations are related to school clubs, and five student orientations 
are related to average size of school clubs. A couple of the associations between 
student orientations and the school organizations are noteworthy: that between ne- 
cessity of following rules and school clubs—gamma .20, and between perceived 
number of rules and average size of school clubs—gamma .18 (Table 2). 

Seven of the dimensions of student orientations to rules are significantly related 
to school board, and the associations between three of these orientations and school 
board are relatively high with gamma of .35 between group approach and school 
board, gamma of .29 between individual approach and school board, and gamma 
of .20 between perceived number of rules and school board (Table 2). That is to 
say, of the organizational variables considered in this analysis, school board is the 
most important one in terms of predicting student responses to statements 
indicating their orientations to school rules. According to chi square test, both 
school board and school size are the most important variables in the organizational 
category. The finding concerning the connection between school size and student 
orientations to school rules can be seen to be important in the continuing debate 
over the influence of school size on different dimensions of school life (see, e.g., 
Riffel, 1971; Sackney, 1976; Hickcox & Burston, 1973; and Holdaway & 
Blowers, 1971). Also, the present finding concerning school boards and student 
orientations to school rules is suggestive of the need to look at the school board as a 
unit of analysis in attempting to understand the schooling experiences of high 
school students. Others, for example, Bidwell (1974/1975) and Cooley (1983), have 
pointed to the need to focus on school district in analysing different features of 
schooling. 


Student Characteristics 


By summarizing the chi squares and gammas between student orientations to 
school rules and four groupings of student characteristics (individual characteris- 
tics, organizational membership, evaluation indicators, and busing and mobility 
factors), the importance of certain independent variables are highlighted while 
others are seen not to be significantly related to or associated with these 
orientations. Nine of the ten dimensions of student orientations focused on in this 
research are significantly related to gender. In addition, the association between 
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each of these nine, taken separately, and gender are, with only one exception, rela- 
tively strong indeed. For example, there is a gamma of .47 between breaking other 
school rules and gender, and there are other gammas of more than .40 between 
breaking classroom rules and gender, and between breaking rules and getting 
caught and gender. Gammas ranging from .30 to .40 are found in three other di- 
mensions of student orientations to school rules and gender. The strong associa- 
tions between gender and different dimensions of student orientations to school 
rules may be an indication that the sex-role stereotyping which was demonstrated 
to be a part of schooling experiences a decade ago (Lorimer & Long, 1979/1980; 
Pyke & Ricks, 1975; and Pyke, 1975) is still a part of, at least, the hidden curricu- 
lum of the school. 


While student responses to six of the statements addressing their orientations 
to school rules are significantly related to age, the associations between these 
responses and age are low with gammas ranging from .05 to .09. Only one of the di- 
mensions of student orientations to school rules (group approach to interaction) is 
significantly related to ethnicity, and the gamma of association between these two 
variables is only low at .08. However, the associations between two other dimen- 
sions of student orientations (necessity of rules and necessity of following rules) 
and ethnicity are considerably higher with gammas of .18 and .14, even though 
these orientations are not significantly related to ethnicity (Table 3). Given the im- 
portance of ethnicity in a wide range of schooling issues such as the dropout prob- 
lem and academic achievement in Canadian society (Henry, 1973; Nagler, 1975), 
the present finding suggesting a low relationship between ethnicity and student 
orientation to school rules is interesting indeed. It challenges us to the all too often 
taken for granted influence of particular social demographic features on schooling. 


Turning to the organizational membership grouping of student characteristics, 
it is noted that only one of the dimensions of student orientations to school rules is 
significantly related to membership in student representative council, and the asso- 
ciation between these two variables is moderately strong—gamma .16. Whether or 
not one is a prefect is an important variable in studying student orientations to 
school rules in that their responses to eight of the ten aspects of rules are signifi- 
cantly related to it. Many of the associations between these orientations and this 
independent variable are also relatively strong. Two of the gammas are over .30 
(necessity of rules, breaking rules and getting caught), and three are between .20 
and .30 (necessity of following rules, breaking classroom rules and breaking other 
school rules). Another three gammas are at .19. It has also been found that nine di- 
mensions of student orientations to school rules are significantly related to the 
number of school clubs which students are members of, and the associations be- 
tween each of these dimensions as dependent variables and the number of school 
clubs as independent variables are generally moderately strong. With a couple of 
exceptions, gammas of .02 and .06, the gammas range from .11 to .23 (Table 3). 


Also, in attempting to understand student orientations to school rules, all of the 
evaluation indicators (grade level, whether or not one failed a grade, number of 
grades failed, number of grades repeated, whether or not one failed a subject, num- 
ber of subjects failed, and number of failed subjects repeated) have been found to 
be important in that student orientations to school rules are, with few exceptions, 
significantly related to each of these indicators. While there are some very weak, al- 
most non-existing, associations between certain dimensions of student orientations 
and some of these evaluation indicators for example, gammas of .01, .02 and .03 be- 
tween certain orientations and grade level, overall the gammas between these 
orientations and each of the evaluation indicators are moderate to high, many of 
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which are in the .20 to the .30 range (Table 3). 

Only four of the dimensions of student orientations to school rules are signifi- 
cantly related to busing, and the associations between these four and busing are 
relatively weak with gammas of .03 to .12. None of the student orientations to 
schools rules is significantly related to whether or not students lived in the same 
community all of their lives or to the number of years one has lived in one’s present 
community of residence (Table 3). ; 


Implications 


This study points to the need to be cognizant of theoretical issues and practical 
concerns in the society-school arenas as one strives to gain greater understanding of 
the schooling process in general and of the rule processes in particular. On a theo- 
retical level, there is a need to distinguish between objective and subjective dimen- 
sions of the school organization. In the present study, attention is given to the 
subjective dimension, that is, the student orientations to school rules. The findings 
suggest that student orientations are significantly related to (X? test), and associ- 
ated with (gamma), student characteristics of gender, membership in student rep- 
resentative councils, being a prefect and all the evaluation indicators included in 
this analysis. Hence, there is a need for theoretical development to find ways of 
explaining the saliency of these variables over the social background variables such 
as parental occupation and parental education which have been used traditionally 
to explain different aspects of the schooling experiences of high school students. In 
fact, the overall relationship between social background variables, on the one hand, 
and student orientations to school rules, on the other, is worthy of further study. It 
may be that the issue of the interrelationships among the processes of class, cul- 
ture, and education are more complex than is often realized. The connections be- 
tween these processes are, perhaps, only glossed over when the major focus is on 
parental education and occupation. 

Turning to the practical implications of the present focus on student 
orientations to school rules, the suggestion here is that identifying the variables as- 
sociated with these orientations is important on two accounts. One is the fact that 
having a greater awareness of the differences in student orientations along certain 
variables such as those included in the social background, organizational and 
student categories leads to a greater understanding of the human dimensions of 
rules. In other words, the factors which play important roles in students’ interpre- 
tations of school rules and of the meanings they give to these rules are highlighted. 
These interpretations and meanings are core elements of the social structure in the 
day-to-day operation of the school. Another practical implication of the present ap- 
proach to rules, and the findings presented herein, is the highlighting of the impor- 
tance of certain parts of the school organization. For example, the finding that 
student orientations are significantly related to whether or not one is a prefect and 
to the number of clubs one has membership in may have implications for the devel- 
opment of these aspects of the school organization. Similarly, the significance of 
evaluation indicators in student orientations to rules reaffirms the importance of 
these parts of school life on student orientations to school rules, which in turn are 
intertwined with disciplinary and other teaching-learning processes in the school. 


Note 


ilf Appreciation is hereby expressed to the Social Sciences and Humanities Research Council of Canada. Without the finan- 
cial support of the Council the research on which this paper is based would not have been possible. 
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Benefits and Disappointments from Participation 
in the Alberta High School Equivalency Program’ 


The Alberta High School Equivalency Program provides adults living in Alberta with 
an opportunity of acquiring a high school diploma, either through successfully pass- 
ing the Tests of General Educational Development (GED) or through the assessment 
of a variety of educationally relevant personal factors. Samples of successful GED ap- 
plicants, unsuccessful GED applicants, and people who received the diploma through 
the alternate route were surveyed to determine their perceptions of the benefits re- 
sulting from the program and their demographic characteristics. The results suggest 
that personal benefits such as self-satisfaction outweighed educational and job-re- 
lated benefits. There was some evidence to suggest that those who had acquired an 
equivalency diploma may have enhanced their chances for employment. Of the demo- 
graphic variables considered, only age was found to be important in explaining the 
survey results. 


It is estimated that between forty and fifty percent of all Canadians who are 16 
years of age or older (excluding those still in school) do not possess a high school 
diploma (Statistics Canada, 1980). In a society in which credentials play a critical 
role in selection of jobs and postsecondary education, many of these people feel 
they are at a disadvantage when they seek employment or further education. To as- 
sist such people, most provinces have set up programs which provide opportunities 
for obtaining a high school diploma without the requirement of additional high 
school attendance. In Alberta, the program is called the Alberta High School 
Equivalency Program (AHSEP). 


Dr. Carbol is the Director of the Examinations Branch in the British Columbia Ministry of Education with interests in the areas 
of student assessments and certification. 

Dr. Maguire is a Professor of Educational Psychology at the University of Alberta with interests in achievement measurement 
and applied statistics. 
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The AHSEP was introduced in June 198] as a joint venture by the two depart- 
ments responsible for education within the Province of Alberta, Alberta Education 
and Alberta Advanced Education. Alberta Education is primarily responsible for 
the delivery and monitoring of programs in the areas of early childhood, elemen- 
tary, and secondary education while Alberta Advanced Education’s primary re- 
sponsibilities are in postsecondary education. At the time of its introduction the 
AHSEP was seen by its sponsors as a way of providing individuals, who for one 
reason or another have not completed their formal high school education, with an 
alternative way of acquiring a high school diploma. 

The program provides applicants with two ways of gaining a diploma: the as- 
sessment of a variety of educationally relevant personal factors, or the writing of 
the Tests of General Educational Development (GED). The personal factors are 
the amount of formal high school instruction received by the applicant, the 
applicant’s age, adult education courses taken, and the extent of travel, reading, 
and private study done. The GED Test Program, which was developed by the 
American Council on Education, consists of a battery of five multiple-choice tests: 
Writing Skills, Social Studies, Natural Sciences, Reading Skills, and Mathematics. 
The tests are administered by provincial departments of education in eight Cana- 
dian provinces and the Canadian territories. State educational authorities adminis- 
ter the tests in all 50 American states. In Alberta, the GED tests are administered 
four times each year at 27 locations throughout the province. A passing standard, 
based on a norming study carried out with Canadian high school seniors in 1980, 
has been established. Alberta Education granted over 5000 equivalency diplomas 
through the Alberta High School Equivalency Program between December 1981 
and December 1983. Of these, approximately 85% were through the GED route. 


The present study was undertaken to assess the validity of the claim that the 
AHSEP helps program participants in meeting personal, further-educational, and 
job-related goals. The primary purpose of the study was to survey samples of par- 
ticipants to determine their perceptions of the benefits resulting from participation 
in the program. A secondary purpose was to determine similarities and differences 
among selected sub-groups of those sampled in an effort to explain the sample 
survey results. Information related to the Alberta equivalency population was 
thought to be useful to Canadian adult educators and provincial education author- 
ities responsible for equivalency programs. To date, public policy concerning high 
school equivalency programs has been influenced minimally by Canadian studies 
related to equivalency populations. 


Related Literature 


The empirical literature related to the outcomes of high school equivalency pro- 
grams is sparse and most of it is American. Variables of particular interest have 
been employer and educational institution acceptance of equivalency program 
graduates, employment and educational benefits derived from program participa- 
tion, and a variety of personal and demographic characteristics of program partici- 
pants such as sex, age, previous highest level of education, reasons for taking the 
GED tests, and others. 

Equivalency program acceptance, especially as it is related to successful com- 
pletion of the GED tests, is well documented. The United States armed forces gen- 
erally accept high school equivalency diplomas as meeting one of their entrance cri- 
teria (Laurence, 1983; Waller, 1976). Similarly, the American Council on Education 
(1982) reported that about 88% of American colleges and universities admit appli- 
cants with GED credentials. 
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Beusse (1974) noted that the receipt of a high school equivalency diploma 
through the GED Test Program offered by U.S.A. armed forces led to higher pay 
and more prestigious jobs in civilian life. Similar employment benefits were re- 
ported by Ayers (1980) for graduates of an American community college, high 
school equivalency program, by Fischer and Evanson (1979) for Adult Basic Educa- 
tion graduates, by Malizio and Whitney (1985) in a survey of 1500 American com- 
panies, and by Moore (1982) for successful GED graduates in Kentucky. Anecdotal 
accounts in the literature also speak of enhanced employment status, opportunities, 
and earnings (Harris, Hedman, & Horning, 1983; Witter, 1978). 


Acceptance of high school equivalency credentials is not as straightforward as it 
might seem, however. Cervero and Peterson (1982), based on a survey of 1906 GED 
candidates who wrote in 1980, found discrepancies between expected training and 
employment benefits of the candidates and those they actually claim to have re- 
ceived. Of those respondents who expected that GED credentials would help them 
to quality for a job, 25% reported that the credential was of no assistance. This 
finding was consistent with the earlier research of Dhanidina and Griffith (1975) 
who concluded that, while the GED may be seen as an investment by participants, 
little evidence exists to suggest that the credential pays off in the job market. A 
similar discrepancy between the expectation of being admitted to an educational 
program on the basis of GED certification and actual acceptance on this basis was 
also noted by Cervero and Peterson (1982). 


Benefits, other than economic, have been detected by some researchers. 
Improved self-image and self-satisfaction have been most prominently mentioned 
(Cervero & Peterson, 1982: Fischer & Evanson, 1979: Harris, Hedman, & 
Horning, 1983; Waller, 1976). 


Studies of American equivalency programs have also considered the personal 
and demographic attributes of program participants. These studies have revealed 
that most candidates are female (Malizio & Whitney, 1981), experience a higher 
level of unemployment than the national average (Behal, 1983), and have between 
nine and ten years of formal schooling (Malizio & Whitney, 1981; Moore, 1982; 
Swarm, 1981). Reasons for participating in an equivalency program are generally 
related to a desire to improve the participant’s job situation or to the educational 
background of the participant (Malizio & Whitney; Swarm, 1981). Characteristics 
such as age, sex, and highest level of education attained also influence the 
motivations of those seeking an equivalency diploma (Cross, 1981; Harvey, 1978; 
Russ-Eft & Steel, 1980). One study has suggested that age and highest level of edu- 
cation are uncorrelated with GED test performance (Modjeski & Michael, 1978). 


The studies of high school equivalency programs in Canada have largely been 
carried out by sponsoring government agencies and have shown mixed results. 
While other provinces have equivalency programs, only in Saskatchewan and Mani- 
toba has there been any attempt to collect evaluation evidence from program par- 
ticipants. Unlike Alberta, neither of these two provinces has based an equivalency 
diploma on the assessment of alternative, educationally-relevant experiences. 


Wedel (1974), in a survey of Saskatchewan equivalency program participants, 
found large Canadian national firms and federal government agencies reluctant to 
consider equivalency credentials as equivalent to regular high school diplomas. 
However, more recent surveys in Saskatchewan have reported improved acceptance 
of the equivalency credential (Province of Saskatchewan, 1977; 1981). A survey of 
Manitoba employers (Manitoba Department of Education, 1981) has also suggested 
a reluctance on the part of some employers to accept the equivalency credential. 
A 1982 Manitoba Department of Education study showed that, while 58% of the 
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respondents expected to gain admission to post-secondary education, only 39% did. 
A similar finding is reported by the Province of Saskatchewan in a 1982 survey of 
equivalency program participants. Program participants in both Manitoba and Sas- 
katchewan were found to be less than 25 years of age, saw the GED tests as a way 
of increasing job and promotion opportunities, and reported improved self-image as 
the greatest program benefit (Manitoba Department of Education, 1982; Province 
of Saskatchewan, 1982). 


Method 


Between December 1981 and December 1983, 4333 applicants successfully 
passed the GED tests in Alberta and received a high school equivalency diploma; 
802 applicants did not pass the tests. During the same period, 822 applicants re- 
ceived the diploma through the alternate route. From each of these three popula- 
tions (GED-pass, GED-fail, and non-GED) random samples of 200 were selected 
from lists and files provided by Alberta Education. 

Following a field trial of questionnaire items involving eight program appli- 
cants, two survey instruments were developed (one for the GED groups and a 
slightly different one for the non-GED group). These instruments addressed the 
following questions: 


1. What are the educational, vocational, and demographic characteristics of pro- 
gram applicants? 


Why did people apply to the program? 
What educational and employment benefits were anticipated? 
What benefits were realized? 


oto Boat 


How do the perceptions relate to group membership? 


The instruments were mailed with a personalized covering letter and a 
stamped, addressed, return envelope to the individuals selected for study. Each in- 
strument was numbered to assist in the mailing of a reminder letter to 
non-respondents. The purpose of the numbering was explained in the covering let- 
ter. A reminder letter was mailed one month later to the non-respondents. Of the 
600 questionnaires mailed, 75 or 12.5% could not be delivered. The return rates, 
which are shown in Table 1, compare favorably with that of Cervero and 
Peterson (1982) who reported a rate for their survey of 24% for their survey. 


Data analysis of the surveys was undertaken in three stages. First, response tre- 
quencies for each of the items in the two instruments were determined. Second, 
items to which large numbers of individuals across the three groups responded 
were subjected to cross-tabulation analyses to investigate relationships between 
and among the demographic and group membership variables and the perceptions 
of respondents concerning program expectations and benefits. Chi square tests of 
independence were used to determine the statistical significance of the relation- 
ships. The final stage of the data analysis involved a study of the relationship of 
the demographic characteristics of the GED groups and performance on the GED 
tests. One-way multivariate analysis of variance was used to test the significance of 
the relationships. 


Results 
Background Characteristics 
Sex, age, and highest grade summaries are reported in Table 2. It is interesting 
to note that the GED-fail group was significantly younger (p<.05) than either of 
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TABLE 1 
Population Sample Deliverable Returned Percentage 
Size ; Returned 
(Ret. + Del.) 
GED-pass 4333 200 168 113 67% 
GED-fail 802 200 167 ha 46% 
non-GED 822 200 190 133 70% 
TABLE 2 
Characteristics of Equivalency Program Applicants 
Characteristic Group 
GED-pass GED-fail non-GED Total 
Sex 
Percentage Male 46.9 46.8 48.9 AT, 
Percentage Female 53.1 53.2 51.1 52.3 
Mean Age in Years S2h7 28.3 31.6 31.2 
Mean Highest Grade 10.0 9.8 iT ide: 10.5 
Completed 


the other two groups. This finding suggested that individuals who were closer in 
time to having attended school were also the least successful in terms of the GED 
experience. 


A significant difference among the three groups was also found for the highest- 
grade completed variable (F[2,320] 69.71, p<.001). Post-hoc Scheffe contrasts of 
group means revealed significant differences between the non-GED group and the 
two GED groups (p<.05). 

In general, the AHSEP participants were found to be similar to individuals who 
have participated in other equivalency programs. In particular, those who had writ- 
ten the GED tests displayed characteristics similar to GED candidates from other 
states and provinces. For example, the mean grade level completed by the Alberta 
sample was found to be grade 10, a level widely reported in the literature for differ- 
ent GED populations (Moore, 1982; Malizio & Whitney, 1981; Swarm, 1981). 


Reasons for Applying to the Program 


Most applicants, regardless of the group considered, applied to the program to 
improve their educational opportunities or to improve their chances of getting a 
better job. Individuals in the non-GED group cited “personal reasons” more often 
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than those in the other groups as a rationale for applying to the equivalency pro- 
gram. These findings are similar to those reported elsewhere (Cervero & 
Peterson, 1982; Province of Saskatchewan, 1982; Malizio & Whitney, 1981; 
Swarm, 1981). 


It was not possible to distinguish the responses of subjects to questions dealing 
with their reasons for applying to the program based upon the highest grade they 
had completed. The age category to which program applicants belonged, however, 
seemed to influence the reasons they gave for applying to the program. In general, 
younger applicants were more interested in job and educational opportunities while 
older applicants were participating to a greater extent for personal reasons. Re- 
spondents who were less than 40 years of age were more likely than those over 40 
to have applied to the program to improve their educational opportunities 
(X?[6,n =323]=13.54, p<.05) while those in the 19 to 25 age group were more 
likely than any of the other groups to have participated to improve their chances of 
getting a better job (X?[6, n=323]=17.59, p<.01). Divergent findings concerning 
the reasons that people of various age groups give for wanting to have an 
equivalency diploma were not unexpected; Harvey (1978), Russ-Eft and Steel 
(1980), and Cross (1981) have all found differential motivation in adult education 
programs to be a function of age. 


Expectations and Benefits 

Discrepancies between expectations and benefits were found for all three 
groups. This finding is again consistent with similar findings reported in the liter- 
ature (Cervero & Peterson, 1982; Malizio & Whitney, 1981; Province of 
Saskatchewan, 1982; Swarm, 1981). As can be seen in Table 3, violation of expecta- 
tions occurred for all groups in the areas of employment and education. For the 
non-GED group, expectations were not severely violated in relation to personal 
benefits, while for the two GED groups the results in the personal area were 
consistent with the results in the other two areas. Table 3 suggests that partici- 
pants were disappointed in the benefits of the program, expecting much more than 
they actually received. The trends of disappointment conveyed in the table, even 
though based on “prior expectations” which were assessed after the fact, seem un- 
deniable. 

To check out these trends, respondents were asked if they had participated 
either full-time or part-time in an educational program since their AHSEP experi- 
ence. Twenty-nine of the non-GED group, 20% of the GED-pass group and 14% of 
the GED-fail group had received or were receiving further education. Fewer than 
35% in each group had even applied for admission to some kind of training pro- 
gram (GED-pass: 32%; GED-fail: 29%; non-GED: 34%). 

Employment patterns for the three groups were also determined through the 
questionnaires. All groups experienced slight increases in paid employment since 
participating in the program. Unlike the Behal (1983) study in which high levels of 
unemployment were reported for all GED candidates, the present study found only 
the GED-fail group to have an unusually high level of unemployment. The 
responses on employment are shown in Table 4. 

If only the unemployed who sought work are considered, the GED-fail group 
has an unemployment rate that is over twice as great as the other two groups. 
Moreover, when prior and post comparisons of employment status are made, only 
the GED-fail group showed an increase in the percentage of those unemployed but 
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TABLE 3 


Comparison of Prior Expectations and Post Program Benefits 


(Figures in Table are Percentages of Respondents) 


Area of Expectation GED-pass GED-fail non-GED 
Prior Post Prior Post Prior Post 
n=113 n=77 n=133 
Help in getting a job 43 as 53 4* 33 8* 
Help in a personal way 40 Males 33 fle 29 23 
Help in being accepted 38 2% 40 125 38 16* 
into an educational 
institution 
Make no differences to 20 46* 8 34* paps alin 
job situation 
Help in getting a 15 eas i it 20 sg 
promotion 
Help in being accepted 13 2 29 1 18 4* 
into an apprenticeship 
program 
Help in keeping present 5 4 1 3 8 3 
job 
Help in getting raise 4 4 0 1 3 3 
Make no difference to 4 il 5 16* 4 6 
educational opportunities 
Not help in any way 2 Zils 0 39* 5 ay 
* change of more than 10% 
TABLE 4 
Employment Status 
(Figures in Table are Percentages) 
GED-pass GED-fail non-GED 
n=133 ny n=133 

Prior Post Prior Post Prior Post 
Employed 
Full-time for pay 53 58 36 43 55 58 
Part-time for pay 13 le 12 8 i. 13 
Unemployed 
Not looking for work 8 3 16 7 9 i, 
Looking for work 16 15 23 33 18 14 
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looking for work. Such a prior-post finding for the GED-fail group is not surprising 
for at least two reasons. First, the GED-fail group tended to be younger than either 
of the other groups and unemployment rates have traditionally been highest, ac- 
cording to Statistics Canada, among young people. This youngest group also has 
less job market experience, an important employment factor, as Dhanidina and 
Griffith (1975) have pointed out. Second, according to Statistics Canada, general 
levels of unemployment in Alberta gradually rose from 4.5% in December 1981 to 
11.1% in December 1983, the time period covered by the study. 


Levels of unemployment for the GED-pass and non-GED groups declined 
slightly. Thus, at a time when Alberta unemployment levels were increasing, the 
unemployment rates of some equivalency program participants actually dropped. It 
may be that acquiring an equivalency diploma helped some individuals to find new 
employment. It is also possible that the shifts in employment data for these groups 
may reflect a more general motivational factor rather than a causal factor. 

Exploratory analyses of the employment data, however, suggested no 
statistically significant differences between the prior and post employment status 
of any of the three groups [GED-pass (X?[25, n= 113]=167.2, p<.01 ); GED-fail 
(X°?[16, n= 77]=54.8, p <.01); non-GED (X?/20, n= 133]=208.9, p<.01)]. These 
findings suggest that there was no change in the employment status of the GED- 
fail group, in spite of the observed percentage increase of those in this group who 
were unemployed but looking for work. 

Personal benefits outweighed educational and job-related benefits according to 
the AHSEP participants. Self-satisfaction was the most frequently reported per- 
sonal benefit. Positive comments concerning the equivalency program by even 
those who had failed the GED tests focused on personal benefits rather than edu- 
cational or job-related benefits. 


Relationships Between GED Test Performance and 
Selected Demographic Variables 

As reported earlier, preliminary analyses of the survey data revealed that the 
GED-pass and GED-fail groups could be distinguished on the basis of age but not 
on the basis of previous highest level of education. To gain a more complete picture 
of the relationship of these variables to GED test performance, multivariate ana- 
lyses of respondents’ test scores were conducted for both the age and highest- 
grade-completed variables. 

Table 5 provides a breakdown of GED sub-test means for four age groups (19- 
25, 26-30, 31-40, 41 and older) of GED candidates. The ordering of sub-test means 
is consistent with the finding that most GED failures in this study were from the 
youngest age category. 

A one-way multivariate analysis of variance (MANOVA) of the vectors of the 
five GED sub-test means for the four age groups also supports this finding. The 
one-way MANOVA resulted in an overall significant difference among the four 
groups (Wilk’s Lambda = 0.839, Rao’s approximate F (15, 502.8)=2.20, p<.01). 
Contrasts of the vectors of sub-test means revealed that the scores of the 19 to 25 
year-old group were significantly different (p<.05) from those of the other age 
groups. No other age group differences were found. 

A similar analysis using four grade-level groups (grades 9, 10, 11, 12) resulted in 
no significant differences (Wilk’s Lambda = 0.874, Rao’s approximate F (15, 
502.8)=1.68, p<.05). Inspection of the sub-test means for males and females re- 
vealed very small differences (usually less than 0.5 standard score points) and 
suggested that further multivariate analysis was not appropriate. 
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TABLE 5 


Age Group 


GED Sub-test Means! 


Writing _ Social Science Reading Math 

Studies 
25 or less (n=69) 48.4 48.6 50.6 51.2 48.0 
26 to 31 (n=39) 54.6 07.5 56.8 57.6 53.0 
31 to 40 (n=51) Zen 55.9 55.8 57.5 53.1 
Over 40 (n=31) 51.5 54.9 53.7 54.8 52.4 


‘Scores reported in the table are standard T- scores (u=50, o= 10). 


Discussion 


Given that the goals and objectives for the Alberta High School Equivalency 
Program are not very different from those of other equivalency programs, it was 
not surprising that the characteristics of the Alberta equivalency population and 
other equivalency populations described in the literature were found to be similar. 
The reasons given for applying to the equivalency progam, prior expectations, and 
benefits reported by the sample of AHSEP participants were not unlike those of 
other equivalency programs. 


It is evident that individuals who have the most difficulty in passing the GED 
tests are young, have low levels of formal education, and are often unemployed. For 
the Alberta equivalency population, age seems to be related to GED test perform- 
ance, a finding not supported in the literature (Modjeski & Michael, 1978). Also, 
GED failures are most often motivated by the belief that a credential will help 
them to get a job or to go on to some form of post-secondary education in order to 
improve their job situation. 


The differential relationship between age and reasons for participation in the 
equivalency program is an interesting one. Older individuals in this study were mo- 
tivated to participate in the equivalency program because, generally, they saw it as 
being personally rewarding. Younger participants, on the other hand, were more 
pragmatic; upon completion of the program, they hoped to get a better job. This 
finding reinforces Cross’s (1981) contention that the reasons adults participate in 
education programs “correspond consistently and logically to the life situations” 
(p. 91) of those individuals. Based on the findings of the present study, more exten- 
sive investigations of the relationship of age and motivational factors to GED test 
performance are in order. 


As in similar programs, there was a tendency to inflate the value of the 
equivalency diploma in terms of its utility in the educational and job marketplaces. 
Many participants saw the equivalency credential as a way of overcoming all bar- 
riers in their attempts to find employment or acceptance to post-secondary educa- 
tion. This may be one reason why discrepancies between prior expectations and 
post-program benefits, similar to those reported by Cervero and Peterson (1982), 
were found. The fact that the employment situations of both the GED-pass and 
non-GED groups remained stable even at a time when unemployment was on the 
rise, has not tempered the disappointment in the benefits of the program for indi- 
viduals in these groups. 
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It appears reasonable to suggest that retrospective judgments of personal bene- 
fits outweighed educational or job-related benefits. Self-satisfaction was the most 
frequently reported personal benefit. This finding is related to but somewhat dif- 
ferent from findings reported in the literature. Enhanced self-image or self-confi- 
dence is the most frequently mentioned personal benefit of equivalency program 
participation mentioned in the literature (Cervero & Peterson, 1982; Fischer & 
Evanson, 1979; Harris et al., 1983; Waller, 1976). The discrepancy between the 
present study and those just cited may have arisen as a result of different data col- 
lection procedures. GED surveys most often directly ask candidates if they feel 
that their self-confidence or self-image has improved as a result of the GED experi- 
ence thereby increasing the likelihood of identifying this benefit. In the present 
study, participants were free to volunteer how they believed the equivalency pro- 
gram had helped them. The importance of self-concept was evident for the Alberta 
equivalency program in the same way as it has been demonstrated to be an impor- 
tant component of adult education programs in general by Wilson (1979; 1980) and 
Rolfe and Wilson (1979). 


While the extent to which the Alberta High School Equivalency Program assists 
individuals in meeting their vocational and personal educational goals must be 
questioned, the value of personal benefits received by participants should not be 
underestimated. Psychological benefits may have assisted some of the participants 
in finding a new job or may have provided the impetus for others to apply to a 
post-secondary institution. It may be that the credential itself is less important 
than the positive outlook created by the success in acquiring it. 


Note 


1 This research was supported in part by a grant from the Alberta Advisory Committee for Educational Studies. 
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BOOK REVIEWS 


CURRICULUM CANADA V. Edited by Douglas A. Roberts and John O. Fritz. 
Vancouver, B.C: Centre for the Study of Curriculum and Instruction, University 
of British Columbia, 1984, 215 pp. 


Every year, the Canadian Association for Curriculum Studies sponsors an 
invitational conference dealing with curriculum concerns in the Canadian context. 
The papers presented at these conferences are published by the Centre for the 
Study of Curriculum and Instruction at the University of British Columbia. Curric- 
ulum Canada V is the fifth in the series. 

It is worth noting at the outset that these volumes, taken together, provide a 
valuable resource for scholars in the field of Curriculum Studies. The papers not 
only address current issues in the field but do so from a Canadian perspective. This 
perspective had been noticeably missing from the literature in previous years. 

When a volume to be reviewed consists of conference papers, one must ask: 
How well did the papers achieve the purpose of the conference? That is the ques- 
tion which will be addressed in this review. 

The conference resulting in Curriculum Canada V recognized that there are 
two groups of stakeholders in the field of Curriculum Studies: subject area 
specialists (e.g., reading, mathematics) and generalists whose concerns cross subject 
matter boundaries. As the editors of this volume write: “Clearly the relationship 
between the two needs exploration.” Therefore, they explain, “The major objective 
of the conference was to clarify issues involved in relating the school subject area 
research to more general curriculum/instruction issues.” 

The conference was organized around presenters representing six major school 
subjects: science, reading, social studies, language arts, mathematics, and second 
languages. The presenters approached the conference theme through a “state of the 
art” appraisal in each of the six areas. Each presenter prepared a paper in advance 
of the conference, and a response was prepared by another scholar in the same 
subject area. 

The conference began with a keynote address by Dr. George Tomkins on the 
history of curriculum in Canada. This paper provides an excellent overview of the 
trends which have brought curriculum in Canada to its present state. Tomkins’ 
paper says basically “Here’s where we’ve been.” The other papers go on to say 
“Here’s where we are now.” 

The papers devoted to specific subject areas describe current research in those 
subject areas as well as the issues involved. This is accomplished in an interesting 
and informative manner. The respondents’ papers broaden the reader’s thinking 
either by agreeing with the first author and providing additional support or by dis- 
agreeing and presenting the reader with another point of view. Both approaches 
prove to be valuable. 
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However, the presenters were also expected to “address themselves to issues of 
general concern for curriculum/instruction thought.” It is this link between subject 
area concerns and broader curriculum concerns which is less evident in the papers. 
Although some of the authors touched on general curriculum concerns, they seldom 
identified them as such. A notable exception is the paper by Ken Osborne which 
concludes with a section on “Social Studies and General Curriculum Theory.” 

One wonders if the presenters were in a position to relate the issues in their 
subject areas to “general” curriculum theory. They are undoubtedly experts in their 
subject areas, but they are not necessarily experts in “general” curriculum theory as 
well. Perhaps they were being asked to do something which they were not equipped 
to do. 

The problem may lie, not with the presenters, but with the organization of the 
conference. In order to bridge the gap between subject area specialists and general 
curriculum specialists, representatives of both groups would need to talk together. 
It is interesting, then, that the presenters and respondents were all subject area 
specialists. Members of the generalist group were apparently not asked to partici- 
pate. 

Their participation, however, would have been useful. For example, if the re- 
spondents to the subject-oriented papers had been generalists, the focus of their re- 
marks could have been “Here’s how general curriculum research can contribute to 
your field” or “Here’s how the research which you have described in your field can 
contribute to general curriculum theory.” Then a meeting of the minds could have 
been broached between the two groups. 

As it is, Curriculum Canada V is a useful volume from the standpoint of cur- 
rent research in the various subject areas. The state of each field is presented well. 
However, a discussion between subject area specialists and general curriculum 
specialists is not to be found in this particular collection of papers. It is hoped that 
a future conference will initiate such a dialogue. 


Jean H. Young 


Department of Elementary Education 
University of Alberta 
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THE EDITOR’S PAGE 


Admonitions for Authors 


In publishing Volume XXXI we used about one-quarter of the manuscripts 
submitted to us. Those which were not used exhibited a number of common char- 
acteristics. Reflecting on these characteristics gives rise to some gentle reproofs for 
prospective authors. 


Don’t raise doubts about your reliability. Typographical errors, incomplete ref- 
erences, excessive use of abbreviations, and overuse of one type of punctuation, 
such as dashes, create subliminal doubts about an author’s reliability. These doubts 
are intensified by inconsistencies in the levels of headings, expressions of numbers, 
and treatment of quotations. 


Don’t ignore instructions about the preparation of manuscripts. Submitting a 
single copy of the manuscript without an abstract and a title page, or one which 
uses the wrong style for references and tables, indicates unfamiliarity with the 
guidelines about form and style included on the inside cover of the journal. 
Sending it to a previous editor suggests that you are not a reader of AJER. Do you 
expect to get published in a journal you do not read? 


Don’t provoke questions about the care taken in conducting the research. Con- 
clusions unsubstantiated by the evidence presented, data in tables which do not 
agree with the text, and citations in the text not included in the references raise 
questions about the trustworthiness of the report. Needless jargon, garbled syntax, 
and ungrammatical constructions also arouse suspicions of muddle-headedness. 


Defects like these result in the manuscript being returned unreviewed, some- 
times with an invitation to submit a revised version. A further caution grows out of 
the review process. 


Don’t evidence the weaknesses frequently identified by reviewers. An introduc- 
tion which does not make apparent the conceptualization and rationale for the 
study, or which does not demonstrate the continuity between the study and previ- 
ous work, is unacceptable. Similarly unacceptable is a discussion section which 
fails to clarify the similarities and differences between the results of the study and 
the work of others, or omits an assessment of the contribution of the study. 
Neglecting to provide a clear summary of what was done and how it was done, or to 
establish the credibility and dependability of the data collection procedures, elicits 
criticism bordering on scorn from many reviewers. Likewise disdained is the report- 
ing of data in insufficient detail to justify the conclusions. 

How does one heed these admonitions and get an article published? Have the 
initial version critiqued by an associate, then rewrite it and have someone proof- 


read it. After further revisions and proofreading, submit it. 
WHW 
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LAWRENCE M. BEZEAU 


University of New Brunswick 


Level and Inequality of Per Pupil 
Expenditure as a Function of Finance 
Centralization 


The effect of centralized revenue collection was tested empirically using a sample 
consisting of the American states and Canadian provinces, while controlling for fed- 
eral jurisdiction and northern state or territory. It was found that centralization had 
no effect at all on the level of per pupil expenditure but that relatively less wealthy 
jurisdictions had more centralized systems. Centralization appeared to reduce the in- 
equality of per pupil expenditure but the magnitude of the effect was small. The cen- 
tralization variables accounted for only seven percent of the variance of the 
coefficient of variation of per pupil expenditure. One of the control variables, annual 
per capita personal income, had a strong relationship with the level of per pupil ex- 
penditure but was not related to the variation of per pupil expenditure. Each 
additional dollar of personal income per capita resulted in an additional 20 cents of 
per pupil expenditure. 


This article examines the level and inequality of annual per pupil expenditure 
in the public school systems of American states and Canadian provinces as a func- 
tion of the degree of centralization of financing. In recent decades many states and 
provinces have assumed a greater share of the financial burden of operating schools 
frequently with the intention of reducing expenditure disparities among school dis- 
tricts. In other cases the intent has been to increase the amount of financial re- 
sources available for education across a state or province. Unfortunately, there is 
very little evidence to suggest that centralization will achieve either of these objec- 
tives. As well there are funding methods available that do not involve centralization 
and that have the same objectives. There are competing theoretical positions, 


outlined below, as to whether centralization will increase or decrease the level of 
expenditure. 


Nevertheless, centralization of funding, especially full state assumption, has fre- 
quently been perceived as a panacea for interdistrict inequality. Several examples 
illustrate this. In 1963 in the Province of New Brunswick, the Byrne Commission 
(Byrne, Andrews, Boudreau, Nadeau, & Wilson, 1963) recommended that the Prov- 


Dr. Bezeau is with the Faculty of Education specializing in educational administration and finance. 
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Level and Inequality 


ince assume all local costs of education. This recommendation was implemented in 
1967 making New Brunswick the second full state assumption jurisdiction in North 
America. In 1969 the United States Advisory Commission on Intergovernmental 
Relations decided that full state financing was the most likely way to attain equal 
educational opportunity. In 1972 in the State of New York, the Fleischmann Com- 
mission (The Fleischmann Report, 1973) recommended that the State move to full 
state assumption over a period of three years. This recommendation has not yet 
been implemented and appears unlikely to be implemented in the near future. 


There has been some comment from the academic community on full state as- 
sumption but as Benson (1975, p. 7) acknowledges, currently available examples 
may not be very instructive. Much of this comment is necessarily speculative. After 
examining various funding schemes other than full state assumption, Cooper (1971, 
p. 351) concluded that, “Unless our national conscience can come to tolerate a pro- 
liferation of enclaves of educational privilege alongside enclaves of underprivilege, 
we must move in the direction of state assumption of the full costs of elementary 
and secondary education.” The few empirical studies of full state assumption 
jurisdictions indicate that even centralization taken to its logical extreme has not 
entirely eliminated inequity (Hight, 1974). 

This study has both a theoretical and a practical purpose. On a practical level, 
we need to know if centralization has achieved the objectives that decision-makers 
expected. Is it positively related to higher expenditure levels and greater equality 
of expenditure? On a theoretical level the study addresses the question of which of 
the two theories best explains the empirical results. The spillover model, in partic- 
ular, has recently been employed to explain economic and financial phenomena in 
the area of education. This is a departure from its traditional role of explaining rel- 
atively pure public goods such as defence and justice. 

This study employed cross-sectional data for the 1979-80 school year to explore 
the two relationships posed above. In practice the centralization of revenue collec- 
tion is often accompanied by other changes which frequently include the centraliz- 
ation of control and administration; but only the issue of finance is addressed here. 
In particular, issues of allocative efficiency, community preferences, local control, 
and organizational accommodation are not dealt with. These issues are discussed to 
a limited extent in Wallschlaeger (1973) and Johns (1973). 


Competing Models of Centralization 


Two theoretical models produce opposite predictions of the effect of centraliza- 
tion of revenue collection on the level of per pupil expenditure. The spillover model 
predicts that higher per pupil expenditure will result from greater centralization 
while the personal service model predicts higher expenditure as a consequence of 
greater decentralization. Centralization is often seen as a means of obtaining 
greater equality of per pupil expenditure, but there are financing systems designed 
to achieve the same result without centralization. 

The model which is here referred to as the spillover model is based on the pre- 
sumed public-good properties of education (Brazer, 1970). Education is assumed to 
have benefits that cannot be captured by the individual receiving the education. 
These are difficult to specify but may include good citizenship, neighborliness, 
lowered propensity to commit crimes, national unity, social cohesion, and more ef- 
fective raising of children. Society as a whole receives these benefits and the 
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individual responsible for the benefit cannot charge others for his or her contribu- 
tion. Since educated individuals do not receive all the benefits of their education, if 
they are left to finance their education from their own resources they will 
underinvest. Milton Friedman (1962, pp. 85-107) refers to these benefit spillovers 
as neighborhood effects and uses them to justify some government intervention in 
financing education. To prevent such underinvestment the community must organ- 
ize, impose taxes on itself, and subsidize education as most communities do. 


Although the community receives the public benefits of the individual’s educa- 
tion, it may not be the same community as the one that subsidized this education. 
Some of the benefits accrue to the larger community: the state or nation. Also indi- 
viduals are geographically mobile; they may leave the local community that edu- 
cated them, depriving it of the public benefits of their education. Individuals are 
not as likely to leave their state of origin and even less likely to leave their country. 
Thus many of the benefits that are external to the local community are internal to 
the state or nation. The benefits that arise from costs borne in a given jurisdiction 
that are external to the jurisdiction are called benefit spillovers. Education benefit 
spillovers discourage a jurisdiction from investing in education. States and nations, 
because they are able to internalize a higher proportion of the benefits, can be ex- 
pected to invest more heavily in education. Thus we would expect to find those 
states and provinces with greater state or provincial and federal financial involve- 
ment to spend more on education and those with greater local school board finan- 
cial involvement to spend correspondingly less. 


A recent study by Hadley (1985) employed the spillover model with American 
states to predict that educational expenditure would be affected by migration pat- 
terns in and out of the individual states. He controlled for per capita income and 
found that this variable had a very strong effect on per pupil expenditure but that 
the migration variables made no significant difference. He reanalyzed the data of 
an earlier study that had reached the opposite conclusion and found that the ap- 
parently contradictory results were caused by a high correlation between migration 
rates and per capita income which made it difficult to distinguish between the two. 


The personal service model assumes that education is something that individ- 
uals will invest in more heavily if they can invest in themselves or in their local 
community rather than in a larger aggregate. This model has most frequently been 
used to characterize health services and to explain low service levels in countries 
with government-financed health insurance. Individuals are believed to be more 


willing to spend money on their own health than on the health of the nation as a 
whole. 


Hickrod (Final Report, 1973, p. 105), in a dissenting note to a report recom- 
mending full state assumption for the State of Illinois, described a mechanism 
whereby decentralized revenue collection may contribute to higher levels of per 
pupil expenditure. Wealthier districts increase their expenditures which motivates 
the educational community to pressure state governments into helping the poorer 
districts to catch up with them. With full state assumption this mechanism would 
disappear, perhaps causing the state to construe uniformity as adequacy. 

Lake (1983, 1982) implicitly applied the personal service model to education in 
Suggesting that full state assumption in the Province of New Brunswick has re- 
duced the sufficiency of education finance in comparison with Nova Scotia. His 
sample of only four jurisdictions did not permit him to test this hypothesis, but he 


and others before him have noted low per pupil expenditures in full state assump- 
tion jurisdictions. 
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With respect to the effect of centralization on equality of per pupil expenditure, 
there are expectations that the way to reduce inequality is to centralize revenue 
collection at the state or provincial level and to hand out the revenue equitably 
without regard to local ability to pay. It is nevertheless possible to equalize the 
ability to pay without centralization. Revenue sharing schemes such as guaranteed 
valuation, percentage equalizing, and especially district power equalizing, permit 
equalization of the ability to finance education without necessarily centralizing rev- 
enue collection (Jargowsky, Moskowitz, & Sinkin, 1977). Using a power equalizing 
approach it is possible, in theory at least, to achieve complete interdistrict equality 
of financial ability without any state or provincial contribution at all. This is done 
through recapture provisions with redistribution of recaptured revenue in such a 
way that a simple relationship exists between the district choice of tax rate and a 
level of per pupil expenditure, a relationship that is the same for all districts in the 
jurisdiction. The level of per pupil expenditure is determined only by the 
willingness of the district to tax itself and not by its ability to pay. 


Sample 


The unit of analysis is the highest subnational jurisdiction, the state in the 
United States and the province in Canada. Because of the similarity of education 
finance in the two countries, the states and provinces were analyzed together when- 
ever data availability permitted. Both countries are federations with the states and 
provinces as the governmental units constitutionally responsible for education. 
Much of this responsibility has been delegated to locally-elected school boards 
which have the authority to levy a tax on real property in most states and prov- 
inces. In both countries the federal government makes categorical grants to elemen- 
tary and secondary education, but this is the level which provides the least amount 
of money. 

The two countries together provided a sample size of 62 jurisdictions. This in- 
cluded all American states and the District of Columbia. The District of Columbia 
was treated as a school district without a state and Hawaii, the only full-state-as- 
sumption jurisdiction in the United States, as a state without school districts. All 
ten Canadian provinces were included but the two Canadian territories were 
lumped together as one jurisdiction. The two full-state-assumption jurisdictions in 
Canada, New Brunswick and Prince Edward Island, have systems of school dis- 
tricts and so were treated normally. 

Data were obtained from published and unpublished sources, the National 
Center for Education Statistics in the United States and from Statistics Canada in 
Canada. NCES provided data disaggregated by school board whereas financial data 
for individual boards were not available for Canada. This prevented the calculation 
of inequality measures for Canadian provinces. All 62 jurisdictions are included in 
the level of expenditure analysis but only 49 American jurisdictions are included in 
the inequality analysis. Of the American jurisdictions, the District of Columbia and 
Hawaii were excluded from the inequality analysis because without two or more 
school districts, it is impossible to calculate a measure of inequality. 
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Methodology 


Multiple linear regression was used for both the level of expenditure analysis 
and the inequality analysis. The dependent variable in the level of expenditure an- 
alysis was the pupil-weighted mean of annual per pupil expenditures of school dis- 
tricts in each jurisdiction. The dependent variable in the inequality analysis was 
the pupil-weighted coefficient of variation of this mean. Per pupil expenditure was 
defined as total recurrent expenditure exclusive of food services and transporta- 
tion, which corresponds to the National Center for Education Statistics concept 
of core current expenditure. The coefficient of variation was calculated by dividing 
the weighted standard deviation by the weighted mean. Although numerous other 
measures of inequality are possible, the voluminous literature on this topic 
(Bezeau, 1979; Love & Wolfson, 1976; Sen, 1973) suggests that the coefficient of 
variation is free of the undesirable properties that characterize many popular meas- 
ures while being relatively easy to understand and explain. In the calculation of 
these measures, the several hundred school boards in North America that send all 
their students to other boards for education were excluded in order to avoid double 
counting. All dollar amounts used were current and were expressed in United 
States dollars for both countries. 


The independent variables were classified as control variables and experimental 
or centralization variables. The control variables were federal jurisdiction, northern 
jurisdiction, and annual personal income per capita. The centralization variables 
were proportions of revenue provided by the local, the state or provincial, and the 
federal levels. 


In general, the control variables describe phenomena that are likely to influence 
the dependent variables but are logically and temporally prior to them. Federal 
jurisdiction was used to control for the higher average per pupil expenditure in 
Canada. The northern jurisdiction variable controlled for the much higher per 
pupil expenditure in the Canadian territories and Alaska. Annual personal income 
per capita was used as a measure of the ability of the jurisdiction to finance educa- 
tion. In studies reviewed by Hadley (1985), this variable is shown to be superior in 
predicting educational expenditures than other related ones such as income per 
child in public school. 


The three centralization variables must be classified as representing centraliza- 
tion or decentralization in order to structure the regression equations and interpret 
the results. Because of the minimal level of participation of the federal level in 
both countries, centralization is seen primarily as a question of local versus state or 
provincial contribution. The state proportion, therefore, represents centralization 
and the federal proportion must also represent centralization. This leaves the local 
proportion to represent decentralization. Since the three proportions add to one, it 
was not possible to enter all of them into one regression equation. Local proportion 
was entered into the first equation and state and federal proportion were entered 
into the second equation. 


The regressions were unweighted, the rationale for this being that the inde- 
pendent variables of concern, the centralization variables, characterize jurisdictions 
rather than pupils, revenue amounts, or populations. Because of this, caution must 
be exercised in interpreting some of the descriptive statistics. The arithmetic 
means in the tables are averages across jurisdictions. To obtain a revenue propor- 
tion for Canada and the United States together, for example, it would be necessary 
to weight each jurisdiction’s proportion by the total revenue for education of that 
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jurisdiction. This was not done. Likewise to obtain the mean per pupil expenditure 
for all pupils in the sample it would be necessary to weight each jurisdiction’s per 
pupil expenditure by the number of pupils. 


Additional analyses, not reported here, were performed using several nonlinear 
models and various standard variable transformations. These analyses included the 
calculation of elasticities. In no case were the results superior to those produced by 
the relatively simple linear model reported here. 


Level of Expenditure Results 


Results for annual per pupil expenditure as the dependent variable are given in 
Tables 1 and 2. First order relationships are examined first as they provide infor- 
mation which is required to interpret the regression results. The control variables 
show the expected correlation with the dependent variable; northern and Canadian 
jurisdictions have a higher annual per pupil expenditure and those with a higher 
per capita personal income spend significantly more on education. Unexpectedly, 
personal income correlates strongly with two of the centralization variables. Local 
proportion correlates positively with personal income and state proportion corre- 
lates negatively. We can conclude that those states and provinces with lower per- 
sonal income per capita are the ones that have chosen to centralize education 
finance. The wealthy jurisdictions have retained decentralized systems. In view of 
the fact that the wealthy jurisdictions also spend more on education, it is of crucial 
importance in the regression analysis to control for some measure of wealth before 
judging the effect of the centralization variables on per pupil expenditure. 


The federal proportion, which averages less than 10%, is not related to personal 
income but the overall correlation coefficient hides important differences between 
the two countries. The coefficient for personal income for the United States is 
-0.41, suggesting a federal policy of using categorical grants for education for 
equalization purposes. The coefficient of 0.04 for Canada indicates no such policy. 
Federal proportion has a significant negative correlation with per pupil expenditure 
even though the state and local proportions do not correlate at all with this vari- 
able. 

The signs on the regression coefficients in Table 2 are consistent with the per- 
sonal service model, but the standard errors of the centralization variables are far 
too high to permit this conclusion or the opposite conclusion. The regression itself 
is highly significant with almost 80% of the variance in per pupil expenditure pre- 
dicted by the independent variables, but the three control variables (NORTH, 
USA, and PINC) account for slightly more than 79% leaving less than one-half 
percent for the centralization variables. An examination of regression results and 
residuals for individual jurisdictions supports the conclusion forced upon us by the 
regression equations, that centralization of revenue collection or the lack of it has 
no effect on the level of per pupil expenditure. 

Personal income per capita accounts for 28% of additional variance after the 
entry of NORTH and USA. An additional dollar of personal income per capita re- 
sults in about 20 additional cents of per pupil expenditure. In the case of the com- 
parison between New Brunswick and Nova Scotia, the only variables that differ are 
the personal income and centralization variables. Using the regression equation, 
Nova Scotia’s higher per capita personal income produces a per pupil expenditure 
difference of $128. The actual difference is only $75. The difference in per pupil ex- 
penditure between the two provinces is more than explained by the difference in 
wealth. This contradicts Lake’s (1983) hypothesis described earlier. 
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Table 1 


Descriptive Statistics and Correlation Coefficients for Annual 
Per Pupil Expenditure and all Independent Variables for 
62 American and Canadian Jurisdictions 


Descriptive Statistics 


Variable Standard 
Name Mean Deviation Description of Variable 
PPEX 1958.954 514.317 Annual per pupil expenditure 
NORTH 0.032 0.178 Northern state or territories 
Alaska and Can. territories = 1.0; Others =0.0 
USA 0.823 0.385 Federal jurisdiction 
USA=1.0; Canada=0.0 
PINC 8934.050 1474.880 Personal income per capita 
LOPR 0.399 0.188 Local proportion of revenue 
STPR 0.506 0.195 State proportion of revenue 
FDPR 0.095 0.049 Federal proportion of revenue 
Correlation Coefficients 
PPEX NORTH USA PINC LOPR STPR 
NORTH 0.712 
USA —0.151 — 0.154 
PINC 0.611 0.236 0.394 
LOPR 0.026 — 0.278 0.421 0.431 
STPR 0.053 0.304 — 0.552 — 0.407 — 0.968 
FDPR = 0312 —0.143 0.579 — 0.036 0.010 — 0.259 
|r| >0.211 is significant at the 0.05 level. 
|r| >0.296 is significant at the 0.01 level. 
Table 2 
Regression Results for Annual Per Pupil Expenditures as 
Predicted by Local, State, and Federal Proportions 
of Revenue for 62 American and Canadian Jurisdictions 
Regression Results with Local Proportion of Revenue 
Multiple R 0.890 R squared 0.793 
F value 54.495 Significance of F 0.0000 
Standard error of estimate 242.239 
Variable B Standard Beta Significance 
Name Coefficient Error Coefficient of Coefficient 
NORTH 1563.784 201.715 0.5416 0.0000 
USA — 432.291 93.297 — 0.3237 0.0000 
PINC 0.204 0.027 0.5838 0.0000 
LOPR 168.964 207.731 0.0617 0.4194 
(CONSTANT) 378.022 198.526 0.0619 
Dependent variable is PPEX. 
Regression Results with State and Federal Proportions of Revenue 
Multiple R 0.891 R Squared 0.794 
F value 43.168 Significance of F 0.0000 
Standard error of estimate 243.635 
Variable B Standard Beta Significance 
Name Coefficient Error Coefficient of Coefficient 
NORTH 1559.378 203.014 0.5401 0.0000 
USA — 383.458 125.048 — 0.2872 0.0033 
PINC 0.200 0.028 0.572L 0.0000 
STPR — 140.153 214.543 —0.0530 0.5163 
FDPR —650.642 841.656 —0.0615 0.4427 
(CONSTANT) 573.269 316.017 0.0750 


Dependent variable is PPEX. 
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Inequality of Expenditure Results 


Table 3 shows somewhat weaker first order relationships for the coefficient of 
variation analysis than were found for the level of expenditure analysis. Personal 
income per capita correlates positively with the coefficient of variation. While this 
relationship is statistically significant, it is not strong. None of the three centraliza- 


tion variables shows significant relationships with the coefficient of variation. 


Table 3 


Descriptive Statistics and Correlation Coefficients for 
Coefficient of Variation of per Pupil Expenditure and 
all Independent Variables for 49American Jurisdictions 


Descriptive Statistics 


Variable Standard 
Name Mean Deviation Description of Variable 
CV 0.195 0.096 Coefficient of variation of annual 
per pupil expenditure 
NORTH 0.020 0.1438 Northern State; Alaska=1.0; others=0.0 
PINC 9118.900 1291.027 Personal income per capita 
LOPR 0.437 0.155 Local proportion of revenue 
STEER 0.458 0.137 State proportion of revenue 
FDPR 0.105 0.039 Federal proportion of revenue 
Correlation of Coefficients 

CV NORTH PINC LOPR STPR 
NORTH 0.535 
PINC 0.248 0.427 
LOPR 0.116 — 0.236 236 
STPR — 0.099 0.281 — 0.082 00) 
FDPR UL LS —0.045 —().647 —0.570 0.361 


|r| >0.238 is significant at the 0.05 level. 
lr| >0.332 is significant at the 0.01 level. 


The coefficient of variation of per pupil expenditure proved to be much more 
difficult to predict with the variables used here than was the level of expenditure. 
Although both the regressions reported in Table 4 are significant, less than 36% of 
the variance is predicted in each case. NORTH shows a strong positive relationship 
to the coefficient of variation, but the relationship between personal income per 
capita and the coefficient of variation is small and insignificant. 

The regression coefficients on the centralization variables suggest that greater 
centralization is associated with greater equality but the effect is small. Local 
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Table 4 
Regression Results for Coefficient of Variation of 
Annual per Pupil Expenditure as Predicted by 
Local, State, and Federal Proportions of 
Revenue for 49 American Jurisdictions 


Regression Results with Local Proportions of Revenue 


Multiple R 0.596 R squared 0.355 
F value 8.251 Significance of F 0.0002 
Standard error of estimate 0.079 

Variable B Standard Beta Significance 
Name Coefficient Error Coefficient of Coefficient 
NORTH 0.429 0.098 0.6400 0.0001 
PINC — 5.386E-06 1.383E-05 =—(0.0727 0.6988 
STPR =().189 0.096 —0.2704 0.0550 
FDPR — 0.094 0.436 — 0.0383 0.8311 
(CONSTANT) 0.332 0.160 0.0439 


Dependent variable is CV. 


Regression Results with State and Federal Proportions 


of Revenue 
Multiple R 0.596 R Squared 0.355 
F value 6.067 Significance of F 0.0006 
Standard error of estimate 0.080 
Variable B Standard Beta Significance 
Name Coefficient Error Coefficient of Coefficient 
NORTH 0.429 0.098 0.6400 0.0001 
PINC —5.386E-06 1.383E-05 —0.0727 0.6988 
STPR —0.189 0.096 — 0.2704 0.0550 
FDPR —0.094 0.436 — 0.0383 0.8311 
(CONSTANT) 0.332 0.160 0.0439 


Dependent variable is CV. 


proportion of revenue has a positive relationship with the coefficient of variation 
which reaches significance at the 0.05 level. The state proportion of revenue has a 
negative coefficient which just fails to reach significance at that level. The federal 
proportion is unrelated to the coefficient of variation. 


The magnitude of the centralization effect on the coefficient of variation is not 
great. A reduction of 10 percentage points in the local share reduces the coefficient 
of variation by 0.018 which is less than one-fifth of a standard deviation. If Oregon, 
the jurisdiction with the second highest coefficient of variation, reduced its local 
proportion of 0.57 to zero this would reduce its coefficient of variation from 0.49 to 
0.39. This would only change its coefficient of variation rank from second to fourth. 


The two control variables account for 29% of the variance in the coefficient of 
variation while the centralization variables contribute an additional seven percent. 
Although this seven percent is statistically significant, centralization of revenue 
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collection cannot be recommended as a policy intended to reduce expenditure in- 
equality among school districts when 93% of the variance will be unaffected by 
whatever policy changes are made. An inspection of individual cases confirms that 
centralized revenue collection is neither a necessary nor a sufficient condition for 
equality of per pupil expenditure. 


Conclusions 


The empirical results do not support either the spillover model or the personal 
service model of education. Centralization of revenue collection does not consist- 
ently increase or decrease per pupil expenditure, but those jurisdictions with lower 
per capita personal income have chosen in disproportionate numbers to centralize 
their education finance systems. This may have given the impression that centraliz- 
ation leads to lower expenditure. Per pupil expenditure is well predicted by annual 
per capita personal income, but after controlling for this variable, the centralization 
variables show no relationship with per pupil expenditure. Any government policy 
which is intended to increase the adequacy of education finance can ignore this 
variable altogether. 


These results combine with those of Hadley (1985) to cast doubt on the useful- 
ness of the spillover model in explaining the finance and supply of education. The 
success of this model in explaining the provision of relatively pure public goods has 
not been duplicated for education. There are several possible reasons. To start 
with, there are the substantial private benefits of education exemplified by the 
higher incomes of educated persons and their generally better working conditions. 
Another possible factor is the embodiment, in the educated individual, of the in- 
vestment in his or her education. Traditional public goods such as defence are not 
embodied in individuals in this way. 

The importance of per capita personal income as a predictor of per pupil ex- 
penditure has been confirmed by this study. Each additional dollar of personal in- 
come resulted in an additional 20 cents of per pupil expenditure. 

Centralization of revenue collection leads to more equality of per pupil expendi- 
ture among school districts but the magnitude of this effect is not impressive. The 
centralization variables account for only seven percent of the variance of the coeffi- 
cient of variation. For this reason, centralized revenue collection by itself cannot be 
recommended to political decision-makers as a means of achieving greater equality. 

Additional research needs to be done to account for the 65% of the variance in 
the inequality of per pupil expenditure which is not accounted for in the research 
reported here. Equality of opportunity continues to be an important policy vari- 
able, but the research results currently available on the determinants of expendi- 
ture inequality are not very helpful to policy-makers. 
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An Examination of Private School 
Issues in Alberta 


When private schools benefit from public funding, any growth in the private educa- 
tion sector attracts particular attention during a time of enrolment decline in the 
public schools and of economic restraint. An examination of the opinions of school su- 
perintendents, board chairmen, and private school principals indicates some diver- 
gence of views with respect to issues such as the historical relevance of private 
schools, freedom of choice among alternatives, public control of private schools, and 
the commitment of parents, teachers, and students to their respective schools. Con- 
siderable consenses was found regarding such issues as freedom of choice and cause 
for a critical assessment of public education in the face of growing interest in private 
schools. 


The purpose of this study was to ascertain and compare the opinions of super- 
intendents and school board chairmen of public and separate school districts, and 
of principals of private schools with respect to current private school issues in Al- 
berta. An earlier study examined the private school movement in Alberta (Bergen, 
1982). Several findings from that study are reexamined here. That study also pro- 
vided a review of selected writing and research on private schools, including four 
decades of writing on private education in the province of Alberta. 


Recent studies regarding private education in Alberta are those of Digout 
(1969), Sloan (1980), Treleavan (1981), Baergen (1982), and Larsen (1984). Digout 
outlined the events which led to the introduction of public funding for private 
schools. Sloan traced the events and legislation leading to the provision of 
“umbrella” agreements by which a private school could operate largely as an alter- 
native school under a public school board. By means of a delphi study, Treleavan 
consolidated the opinions of 39 “knowledgeables” representing private school asso- 
ciations, teacher and school trustee associations, school superintendents, Depart- 
ment of Education officials, and politicians. Private schools were not perceived to 
be a threat to public education, however public financial support was seen as lead- 
ing to some political conflict. Also, the government policy was perceived to be 
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favoring developments in private education. This latter perception appears to be 
made explicit in various statements and documents current in 1985. 


Baergen (1982) examined secondary private school policies and manuals. He 
found some of these to contain exclusive admission policies. He questioned the re- 
stricted academic programs offered by most of the schools, and suggested that pri- 
vate schools should receive reduced grants according to the extent that they differ 
from public schools. Larsen’s (1984) study was undertaken for a county school com- 
mittee which wanted a survey of opinions regarding its public schools with the in- 
tent of meeting the needs and concerns of its resident school population in such a 
manner that interest in private schools would decline. Apparently the committee’s 
effort proved to be successful. 


Context 


The first schools in colonial Canada were private and most of these were estab- 
lished by Catholic and Protestant denominations. As public school systems evolved, 
a number of private schools continued to exist. At the same time, denominational- 
ism remained a characteristic of provincial school systems. Religious exercises for 
schools were prescribed in all provinces, and religious instruction was permitted ac- 
cording to local choice in all but British Columbia. 


One province (Quebec) had two denominational school systems, and another 
(Newfoundland) several. These constituted the provincially supported “public” 
schools. In three provinces (Ontario, Alberta, and Saskatchewan) provision was 
made for “separate” schools as part of the public school system. These separate 
schools could be organized by the minority denominational group within a school 
district. According to the Canadian constitution of 1867, these were defined as 
either “Catholic” or “Protestant.” Generally, non-Catholic schools have been re- 
ferred to as public schools and the Catholic schools as separate schools. Both 
categories are distinct from private or independent schools. The remaining five 
provinces have “nonsectarian” schools only. The public schools, at one time pre- 
dominantly of Protestant hue (Patterson, 1970; Stamp, 1970), over time became 
secularized. Some parents who might have found the public schools of fifty years 
ago quite acceptable, today find their secularization excessive. This, in part, ac- 
counts for an increased interest in denominational private schools, a trend which 
currently is causing some concern among public school supporters. 


Concern about private schools has evoked considerable debate in the province 
of Alberta. The Minister of Education engaged the Canadian Gallup Poll to survey 
public attitudes and opinions about education generally, including those about pri- 
vate schools. The opinion survey was published in May 1984. The Minister also 
appointed a Committee on Tolerance and Understanding to examine the instruc- 
tional content of public and private schools and a private consulting firm to make 
an extensive survey of private schools in the province. Both reports were made 
public early in 1985. Findings of these reports which are related to this study are 
discussed later. Also, the Minister’s Policy Advisory Committee for the School Act 
Review (1985) received briefs in response to its “principles” which included provi- 
sions for private schooling. 

In Canada, private schools are not a national concern except insofar as a part of 
the fees paid to private schools are submitted for income tax deductions. New- 
foundland and Prince Edward Island currently have no operating private schools, 
and very few exist in Nova Scotia and New Brunswick. Concerns are raised in 
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provinces where partial funding of private schools has been introduced, namely the 
four western provinces and Quebec. The Davis proposal of extending funding to 
senior high school grades for Ontario Catholic Separate Schools, grades which had 
been offered on the basis of private school fees, provoked renewed discussion re- 
garding public funds for private schools generally. Subsequently a commission on 
private schools was appointed and has since completed its recommendations 
(Shapiro, 1985). To date, a limited number of studies on private schools, generally 
restricted to the individual provinces, are available. The study reported herein adds 
to the examination of concerns within one province. This provincial focus is not un- 
usual. Structures, laws, and funding formulas differ sufficiently from province to 
province to make interprovincial comparisons very difficult. However, an accumu- 
lation of provincial studies may precede an interprovincial study of private schools 
still to be undertaken. 


Method 


A first questionnaire was mailed to all public school superintendents in Novem- 
ber 1983 in order to determine the perceived extent of private school developments 
in the jurisdictions of the respondents. The distribution and return of question- 
naires is shown in Table 1. This questionnaire was not mailed to separate school 
superintendents because private schools have to date not caused any noticeable 
drain of students who are residents of separate school districts. There are, however, 
some Catholic parents who have enrolled some of their children in non-Catholic 
private schools. Also, since 1982 one private Catholic school has been operating in 
Edmonton. Its students are residents mainly of the Edmonton Separate School 
District. As separate school districts are geographically coterminous with public 
school districts, separate school superintendent returns would have duplicated 
much of the same information. 


Table 1 


Distribution and Return of Questionnaires to 
Public School Superintendents 


Category Delivered Returned % Return 
City Districts 9 8 88.9 
Town Districts 9 100.0 
Village & Rural Districts iil 4 36.4* 
Divisions & Counties 60 49 81.7 
Total 89 70 78.7 


* A probability exists that some small school districts were being served by superintendents of other or larger 
school systems, and that each respondent completed only one return covering all jurisdictions under his di- 
rection. 


A second questionnaire was mailed to a sample of both public and separate 
school superintendents, to public and separate school board chairmen, and also to a 
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sample of private school principals. The respondent sample comprised half of the 
superintendents of the province, the school board chairmen of the same constituen- 
cies, and principals from half of the private schools. The latter comprised a select- 
ed sample in order to assure representation from various categories of private 
schools. Superintendents and board chairmen were selected in such a manner that 
approximately every other school board jurisdiction across the width and length of 
the province would be represented. In all, 169 questionnaires were mailed, and 74 
were returned, as detailed in Table 2. All questionnaires were mailed to the re- 
spondent groups towards the end of November 1983. Most returns were received 
by the end of January 1984. 


Respondents were asked first whether they agreed or disagreed with each of a 
set of 15 statements. Next, they were asked to provide written reasons for their 
responses. The first set of responses were tabulated according to the type of school 
jurisdiction. Each written response was assessed by a second researcher in order to 
arrive at agreement with respect to the appropriateness of its categorization and 
tabulation. 


Table 2 


Distribution and Return of Questionnaires 


Category ; Delivered Returned % Returned 
Public School Superintendents 40 20 50.0 
Separate School Superintendents 24 13 54.2 
Public School Chairmen 40 15) Seis 
Separate School Chairmen 24 7 29.2 
Private School Principals 41 i 46.3 
Total 169 74 43.8 
Findings 


Perceived Extent of Private School Development in Public School Jurisdictions 


The initial part of the first questionnaire to public school superintendents con- 
cerned the extent of private school developments in the jurisdictions of the re- 
spondents. Table 3 summarizes these responses. Twenty-three of the 70 responding 
superintendents indicated that no private schools were located in their 
jurisdictions. Nineteen reported the existence of category one private schools (those 
employing certificated teachers and receiving government grants), 15 reported cate- 
gory four private schools (probably not employing certificated instructors), and 18 
the existence of unauthorized operating private schools. As category two and three 
private schools (special schools for the handicapped and second language schools 
for the instruction of a second language during hours outside of the regular school 
day) do not cause a reduction in regular school enrolment, no enquiry was made re- 
garding such schools. 
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Table 3 
Extent of Private School Developments as 
Reported by Responding Public School 


Superintendents 
ee eee ee ee ee eS eee a eee 8 ea ee ae 9 eee 
Item School Jurisdiction Subcategory Totals 
Urban Nonurban/Rural 
City/Town Private Private 
Schools Schools 
located not located 
in district within 
district 
No. of returns Pall 26 23 70 
No. reporting cat. 1 private schools 6 ils 0 19 
No. reporting cat. 4 private schools 3 12 0 15 
No. reporting unauthorized schools 3 15 0 18 
No. reporting private schools (any cat.) 9 26 0 3D 
Total No. of cat. 1 schools reported by all ey 20* 0 Baye 
respondents 
Total. No. of cat. 4 schools reported by De 18* 0 23* 
all respondents 
Total No. of unauthorized schools re- 6* ike 0 Dake 
ported by all respondents 
Total No. of private schools reported 26* 55* 0 Sie 
(any category) 
No. reporting loss of students to private 14 26 10 50 
schools 
Range in No. of grades 1-9 pupils lost by 10-500** 6-470** D=2 06a 
public school systems 
Range in the No. of grades 10-12 stu- 7-200** 2-250** et ee 
dents lost by school systems 
No. reporting formal and/or informal ar- 2 2 0 4 


rangements with private schools 


* These numbers may be inaccurate; superintendents may not be aware of the actual status of every private 
school located within their jurisdictions, nor may they have an accurate count of the total number of oper- 
ating private schools. 

** Kstimates 

*** Pupils attending private schools outside of the reporting superintendents’ jurisdictions. 


The returns suggest that about one-third of all private schools are located in 
urban centers. Fifty respondents reported a loss of students to the private schools. 
For some the numbers were quite small; however, in some jurisdictions estimates 
ranged as high as 500. It is also apparent that the summary may not include 
Edmonton, where more than 1,000 students attend private schools. (Alberta Col- 
lege alone enrolls about that many, though most of these might be adult students 
who would not be attending a regular high school.) At least 12 superintendents 
from larger school systems did not respond. A total response might have reported 
more than 100 operating private schools even though all superintendents may not 
be aware of the precise number of such schools in their jurisdictions. 
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Four superintendents reported arrangements with private schools. These in- 
cluded the following: private school students having access to the music program in 
a junior high school; busing of students to private schools; loan of some instruc- 
tional materials; tuition agreements; and a cordial relationship in that invitations 
to school programs are accepted. Sixty-three of the 70 respondents reported that 
no relationship of any kind was observed with respect to private schools. 


Superintendents responded to questions concerning the impact of private 
schools as follows: | 
1. The nature of the impact of private schools upon the programs of the public 
schools. 


Sixty-four of the 70 superintendents responded to this item. Little or no impact 
was reported by 33 of the superintendents. A loss of grant revenue and/or curtail- 
ment of school programs was reported by 13. The most serious consequences were 
suffered by small districts. 

2. Foreseen future private school developments in jurisdictions. 


Fifty-nine superintendents responded to this item. Of these, 28 reported no 
foreseen developments. However, further developments were anticipated in half of 
the jurisdictions in which private schools had been established, and interest in pri- 
vate schools was anticipated in five of the 23 jurisdictions which did not contain 
private schools. 


Respondent Views Regarding Statements About Private Schools 


A set of 15 statements were formulated, each of which appeared as a reasonable 
assumption based on an earlier study on the development of private education in 
Alberta (Bergen, 1982). Table 4 details for each respondent group the percent of 
respondents agreeing with each statement. Selected comments are quoted in order 
to portray the diversity of views. Any one comment need not, but in several 
instances does, represent the majority view of the group. 

Comments are identified according to the category of the respondents as fol- 
lows: 

PS—public school superintendent 
SS—separate school superintendent 
PC—public school board chairman 
SC—separate school board chairman 
PP—private school principal 
1. Historical roots have relevance in current decisions about the place of private 
and alternate schools in the context of a provincial education system. 

Most respondents agreed with this statement, the lowest proportion (68.2%) 
being for board chairmen. Numerous comments (18), in greater proportion by sepa- 
rate school superintendents and private school principals, were made with refer- 
ence to human rights and religious convictions. Public school superintendents and 
board chairmen generally made comments (16) to the effect that present conditions 
should take precedence over historical factors. 

Woods Gordon (1984) found “no reason to disallow private schools from con- 
tinuing to operate in Alberta” (p. 17). 

Selected comments 


What is important is the present situation—the rights of individuals and/or groups. (SS) 
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Percent of Respondents Agreeing With Statements 
Relating to Private Schools 


es 5 ee SS ae a re ae ee eae ee pn, Se ee ee i ee, 


Statement 
1. Historical relevance 
2. Freedom of choice 
3. Useful function 
4. Grants to private schools 
5. Constitutional precedence 
6. Cause for assessment of public 
education 
7. More provincial control 
8. Need for advance notice to dis- 
trict boards 
9. Need for undiminished grants 
to district boards 
10. Private schools be responsible 
to district boards 
11. Open admission policy for pri- 
vate schools 
12. Uncertificated teachers accept- 
able in private schools 
13. Commitment of public relative 
to private school teachers 
14. Commitment of public relative 
to private school parents 
15. Commitment of public relative 


to private school students 


Public 
School 
Supts. 
N=20 


85.0 
70.0 
65.0 
40.0 
39.0 
75.0 


85.0 
95.0 


85.0 
60.0 
100.0 
30.0 
60.0 
30.0 


50.0 


Respondent Group 
Separate School 
School District 
Supts Chairmen 
N=13 N=22 
84.6 68.2 
923 86.4 
100.0 (24 
69.2 54.5 
76.9 59.1 
92.3 95.5 
100.0 95.5 
wa 100.0 
76.9 124 
61.5 63.6 
61.5 90.9 
46.2 9.1 
53.8 68.2 
46.2 50.0 
61.5 50.0 


Private 
School 
Principals 
N=19 


94.7 
94.7 
100.0 
89.5 
89.5 
89.5 


42.1 
47.4 


15.8 


Catholic Separate School Systems and/or Protestant Separate School Systems only, have 
historical roots that should be considered in any “dual” country, for example, Bilingual 
and Bicultural. (PC) 
Indeed historical roots have relevance. Too many of us have failed to understand or ap- 
preciate the history of private schools which predate our province! (SC) 
Private schools are not a new phenomenon in Alberta or elsewhere in the world for that 
matter. Throughout recent history Christian schools have been educational leaders, only 
later to be replaced by state education. (PP) 
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2. The freedom of parent and/or student choice—particularly in the context of 
the Canadian Charter of Rights and Freedoms—has relevancy in private and 
public education. 

Most respondents agreed with this statement, with the lowest proportion (70% ) 
being for public school superintendents. Comments (18) supported freedom of 
choice as an accepted principle. Statements of concern (15) relating to standards 
and the protection of the student were made in greater number by public school 
superintendents. 

According to the Gallup Poll (1984), 93% of all respondents supported the right 
of parental choice. Woods Gordon (1984) also found general support of the right of 
parents to choose the school. The Minister’s Advisory Committee (1985), in enunci- 
ating principles for a new school act, acknowledged that schooling may occur 
“outside of a conventional school,” and that parents may be choosing “other than a 
public or separate school” (p. 11). 

Selected comments 

Freedom of choice is an accepted principle of our democratic society. (PS) 

None of the rights which were entrenched in the B.N.A. Act of 1867 regarding private and 
separate schools have been changed in any way by the Canadian Charter of Rights. (SS) 
Children are not always able to choose what is best for them in the long-term. Many par- 
ents think in terms of their needs and do not understand the needs children will have to 
prepare themselves for the society they will be part of. (PC) 

They may have to await Court decisions of this matter. But my visceral feeling is that the 
freedom of the parent has relevancy within the context of the Canadian Charter of Rights 
and Freedoms. (SC) 

3. Private schools do represent a useful function in providing for specific needs 
expressed by individuals and groups. 


All separate school superintendents and private school principals agreed with 
the statement. The lowest proportion of agreement (65%) was found for public 
school superintendents. The comments (15) of public school superintendents and 
board chairmen suggested that sufficient provision for all needs can be provided by 
the public schools. Superintendents, public and separate, also commented (9) on 
the possible danger of excessive parochialism on the part of private schools. 


Though only three percent of the Gallup Poll respondents had children in pri- 
vate schools, 72% of all respondents indicated that they would recommend the pri- 
vate school as an alternative. 

Selected comments 
All parents favoring private schools should remain mindful of the rest of society and the 
different backgrounds of others—the danger with private schools is that they could get 
very narrow in scope. (SS) 
They meet the specific needs of the group that cannot get satisfaction within the public 
system. (PC) 
So long as public/separate boards respond slowly or do not respond at all to special 
groups, then private schools will continue to flourish. (SC) 
Although the public system is doing a good job for the majority of students it is beneficial 
to maintain private schools to allow alternatives for parents and/or students who are 
seeking a type of education or an orientation in education which they feel is not found in 
the public system. (PP) 
4. Since private school supporters, like public school supporters, through taxa- 
tion support public education, this has a bearing on whether or not public 
grants might be allocated to private schools. 
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Only 40% of public school superintendents, in contrast to 69% of separate 
school superintendents, were in agreement with this statement. A closer examina- 
tion revealed that rural superintendents tended to be less in agreement with the 
statement than were urban superintendents, and public school chairmen less than 
separate school chairmen. With the exception of public school superintendents, the 
other respondent groups volunteered comments (25) to the effect that since private 
school supporters paid public school taxes, they deserved some entitlement. 


According to the Gallup Poll (1984), 77% of the respondents agreed that par- 
ents ought to be able to direct tax dollars to the school of their choice, and 57% fa- 
vored provincial funding of private schools. Woods Gordon (1984) found general 
disagreement among public school groups towards the proposition of parental right 
to direct taxes to a school of their choice, and opposition to support of private 
schools through the regular means of taxation available to public schools. Woods 
Gordon also. found that the majority of public school respondents thought current 
public funding of private schools to be adequate. However, in his recommendations 
to the Minister, Woods Gordon favored some additional support for private 
schools, and said that according to an appropriate formula, private schools which 
meet provincial standards be accorded the same types of provincial grants (with 
the exception of capital grants for facilities) which are available to public schools. 

Selected comments 

Everyone benefits from the general education of a community like they do from roads and 
so should pay to provide them. If a person wants a private road or private education for 
his child that is a totally separate matter. (PS) 

I have no objection to parents’ educational tax dollar following the child to his/her private 
school. (SS) 

I am somewhat of a believer that the “dollars” of support should follow the child. (PC) 
Parents of students in private schools are also part of the public and so their children 
should qualify for public grants. Also, private schools do perform a public good by educat- 
ing children. (PP) 

5. The existence of a constitutionally-based choice for Catholics is an argument 
for providing a similar, even though limited, choice to other denominations 
which would be different from that of being locked into a secular or nonsec- 
tarian school system. 

Most private school principals (89.5%) were in agreement, whereas a minority 
of public school superintendents (35.0%) agreed. It is understandable that a 
greater proportion of Catholic separate school superintendents (76.9%) would be 
inclined to agree. A closer examination indicates that a greater proportion of sepa- 
rate board chairmen than public chairmen tended to agree with the statement. A 
number of statements (23) suggested that other groups should be accorded privi- 
leges similar to those enjoyed by Catholics. Other statements (10), most of which 
were proffered by public school superintendents, warned that some practical limits 
must be set in order to prevent extreme fragmentation of the school system. 


Selected comments 

It is my belief that the formation of a Catholic school system has not contributed in a pos- 
itive way to the advancement of education or democratic principles. To create further 
fragmentation would compound the problem. (PS) 

I am a superintendent of an R.C. Separate District. The arguments I use to justify sepa- 
rate schools are identical to those used by denominational private schools. (SS) 
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History does not so favor other religious groups in Alberta. But Catholics are a minority, 
thus I often ask why should one minority be so favored? (SC) 
If we accept that Canada is a multicultural nation, then we must also accept that it is a 
multireligious nation. If we accept that human rights are a fact, then we cannot afford to 
overlook the above in providing these in our Charter and school systems. (PP) 

6. Any perceived “flight to private schools” is cause for a more critical assess- 
ment of current public education. - 


There was overwhelming agreement with the above statement. The lowest pro- 
portion (75%) was found among public school superintendents. Among the latter, 
the rural superintendents tended to be more in agreement with the statement than 
the urban. Most statements (25) suggested the need for a study of inherent prob- 
lems and an evaluation of public schools. 


The Gallup Poll found that 74% of respondents would want religion in schools 
either as a core subject (12%) or as an option (62%). Eighty percent would want 
moral or ethics education as a core subject (30%) or as an option (50% ). The Cana- 
dian Education Poll (August, 1984) related that 18% of polled Canadians reported 
little confidence in their public school, and that 48% said that they had “too little 
say” in how schools are run. The Minister’s Advisory Committee (January 1985) 
recommended that schools develop policies “which reflect the value of high 
standards of interpersonal conduct,” and “that foster the development of moral and 
ethical standards” (p. 7). 
Selected comments 
We cannot be blind to the inequalities and inadequacies of some of our schools... . 
Evaluation of systems, teachers, students must be ongoing. (PC) 
Trustees should take up the mantle and assess what is happening in their own 
jurisdictions. (SC) 

7. More general control and evaluation of private schools should be exercised by 
the provincial Department of Education. 


Only 42.1% of private school principals agreed with this statement which re- 
ceived overwhelming support from respondents in each of the other categories. 
About 41 volunteered comments suggesting that private schools should undergo 
about the same degree of control as public schools. Half the comments of private 
school principals claimed that the amount of current control is adequate. 


The Gallup Poll found that 66% of the respondents thought that private 
schools need to be authorized by the government; 90% that their teachers meet 
provincial standards for certification; 85% that their buildings, and 84% that their 
curriculum meet provincial standards. 


Most respondents in the Woods Gordon (1984) study thought that current reg- 
ulations, monitoring and evaluation procedures to be adequate. The Province cur- 
rently possesses all necessary legal powers to regulate and monitor private schools. 
Any restrictions would be such as might arise from an interpretation of the Cana- 
dian Charter of Rights and Freedoms and the Alberta Bill of Rights. Such testing 
of the performance of students in private schools as compared to students in public 
schools, which was done at the behest of Woods Gordon in its Minister-mandated 
study, did not produce results which allow clear and indisputable conclusions or 
cause for alarm. A major concern was expressed in relation to insufficient Canadian 
content of curriculum materials imported from the USA. 
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Selected comments 


Although parents may choose the private school route, their children are entitled to the 
best quality possible. Alberta Education has the responsibility of monitoring all educa- 
tional institutions to ensure that children are not shortchanged. (SS) 


The Department of Education should be able to satisfy itself that a private school is pro- 
viding a good quality education for its students. (PP) 

8. A public school board should be given advance notice and an opportunity to 
plan program adjustments before a private school is permitted to open its at- 
tendance area. 


Superintendents and board chairmen were in substantial agreement, and about 
half (47.4% ) of the principals also agreed that such notice ought to be given. About 
38 comments reinforced the position that school districts need advance notice in 
order to adjust plans when they face the probability of a decreased enrolment. 
Selected comments 
Yes, primarily to explore more fully the avenues of making the role of “public” school bet- 
ter understood and also to plan reasonable adjustments. (PS) 
School boards should also be given the opportunity of responding to the special educa- 
tional needs and providing the special services which will be met by the private school. It 
may be that a board has not been aware of the specific needs of its constituents. (SS) 
If this were to happen, with concerned parents honestly discussing their concerns, there 
could be fewer private schools. (PC) 
I believe it would be good P.R. for private schools, before they get established, to give ad- 
vance notice to the public school board about establishing a school. .. . We established 
our school for religious reasons (faith commitment), not necessarily because we felt the 
public schools were doing a “bad job.” (PP) 

9. A public school board should receive undiminished provincial grants during a 

period of adjustment when an enrolment decline is due to the opening of a 

private school. 


Unlike private schools, public school systems have a provincially mandated 
obligation to provide for every resident student. When public schools lose students, 
they also lose the per student provincial grant subsidy. Particularly during times of 
general enrolment decline, the loss of students adds to the burden of meeting 
school costs. Only 15.8% of private school principals responded at all to this item, 
and these were in agreement with the other respondent groups. About 35 com- 
ments emphasized the need of undiminished financial support during a period of 
adjustment. 

10. The incorporation of “private schools” as “alternate schools” under commit- 
tees responsible to a public board is a desirable course of action. 

Only 21.2% of the private school principals favored this alternative. Neither did 
the alternative receive overwhelming support from any of the other groups. Re- 
spondents from all groups offered comments (22) stating that such accommodation 
ought to be possible. About as many (21) warned that such an arrangement would 
be undesirable for both parties. Separate school chairmen tended to agree with the 
statement in greater proportion than did public school chairmen. 


Selected comments 

I think that this should be the first step prior to starting a private school. If the public 
system does not or cannot meet the needs of a particular group then it should be allowed 
to plan the operation of a private school. (PC) 

The experience of some schools which have elected this route in the past appears to be far 
less than desirable. Operating directly under the jurisdiction of the Department as a pri- 
vate school is generally preferable. (PP) 
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11. Private schools which receive public grants ought to be obligated to observe 
open admission policies similar to those of public schools. 
Though this position met with unanimous support on the part of public school 
superintendents, only 61.5% of the separate school superintendents favored the 
prospect. The latter are cognizant of some controls they can exercise with respect 
to which sector of the local populations shall have unrestricted access to their 
schools. It is interesting to note that a proportion as large as 42.1% of private 
school principals supported the open admission position. Comments included: that 
funding should be related to the nature of an open admission policy (11); that the 
adherence of admitted students to school policy is essential; and that an open ad- 
mission policy might defeat, in part, the purpose of private schooling (5). 
Selected comments 
All the children who meet the basic criteria which set the private school apart from the 
public school should be admitted. (SS) 
If the philosophy, goals, and objectives meet the parents’ needs and if the public grants 
are at par with the public schools, then an open admissions policy ought to be in effect. 
(BP) 

12. The option of operating a private school without certificated teachers when 
no public subsidy is being received should continue. 


For each group, support was registered by only a minority of respondents. 
Urban more than rural superintendents, and separate more than public board 
chairmen were inclined to agree with the option. Eighteen respondents indicated in 
their comments that with adequate supervision and evaluation such accommoda- 
tion should be possible, yet 23 others insisted that certification of teachers should 
be a requirement for private as well as for public schools. 


Selected comments 

The option of operating schools without certificated teachers when no public subsidy is 
being received should be allowed to continue. (SS) 

I think it best that all teachers be certificated. ... Their values or secular viewpoints may 
have to pass the scrutiny of a private school board. (PC) 

The option of operating a private school without certified teachers should continue, as 
long as it can be verified and demonstrated that the educational standards are at par with 
public schools. (PP) 

13. Public school teachers are as committed to their schools as are the teachers of 

private schools to their schools. 


Only 26.3% of private school principals agreed with the statement. However, 
the other three respondent groups did not show an overwhelming conviction re- 
garding the commitment of their teachers in relation to private school teachers. 
Most comments (18) indicated the belief that the commitment of teachers was 
about the same in private and public schools. Another ten felt that private school 
teachers need to be more committed as they enjoy less security of tenure and may 
be teaching for lower salary than public school teachers. Others (7) felt that com- 
mitment was a characteristic of the individual teacher, whether the teacher be in 
one type of school or in another. 


Selected comments 


Teacher commitment as a norm cannot be associated with one more than with the other. 
It is a matter of an individual’s professionalism. (SS) 


Most public school teachers are committed to their schools but I think all teachers of pri- 
vate schools are committed and extremely dedicated to their school. (PC) 
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Some public school teachers are very committed to their schools. The difference is that 
the public school system appears to be stuck with the uncommitted as well as the com- 
mitted. (PP) 

14. Parents of children in public schools are as committed to their schools as are 
parents of children in private schools to their schools. 


Private school principals tended to be less in agreement with this statement 
than respondents from other groups. However, only 50% of board chairmen, and 
less than half of the public school superintendents (30.0%) and of the separate 
school superintendents (46.2%), felt confident in claiming the same commitment 
for parents of children in their schools. Comments included that there was little 
difference (20); that the additional costs and involvement accounted for greater 
parental commitment for private schools (23); and, that parents take public schools 
for granted (5). 
Selected comments 
Private school parents may have to devote more of their resources towards their schools, 
but this should not be confused with commitment to their children’s education. (PS) 
Very probably—and with one difference! When a parent has made a choice and is paying 
fees, he/she is more likely to be involved in the school. If you are paying for something di- 
rectly then you are going to make sure that it works. (PP) 

15. Students of public schools are as committed to their schools as are students 
of private schools to their schools. 


Only 61.5% of separate school superintendents and 50.0% of public school su- 
perintendents and board chairmen agreed with the statement, and only 21.1% of 
private school principals thought such to be the case. Many respondents, represent- 
ing respondents from each group, felt that there was little difference in commit- 
ment on the part of students in public or private schools, but that any differences 
were those attributable to the individual student and the student’s home (25). 
Others (10) felt that private school students might be more committed as a result 
of a selection process available to private schools. 


Selected comments 

I suspect that students whose parents have to pay tuition fees are pressured by parents to 
be more committed to achieve than perhaps some students in the public school system. 
However this depends upon the parents. (PS) 

Students in private schools are more committed probably because they or their parents 
have chosen a school with which they can strongly identify. (PP) 


Summary and Discussion 


The findings arising from the responses to the first questionnaire to public 
school superintendents allow the following generalizations: 


il Private school growth was not restricted geographically, but has occurred gener- 
ally in all parts of the province; 
2 There has been an uneven loss of students to private schools among 


jurisdictions—the smaller districts suffered the impact of loss in enrolment more 
than did larger urban districts; 


3. A considerable growth of small unauthorized private schools had occurred by 
1984 (only two or three of these had neglected to obtain ministerial authorization 
by the summer of 1985); 

4, Only a few school systems under the school boards had established formal or in- 


formal relationships with private schools located within their district boundaries; 
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5. About half of the superintendent respondents anticipated further private school 
developments within their jurisdictions; 

6. Very few school districts operated alternative schools for special resident interest 
groups, though about half of the respondents indicated that they considered 
such arrangements preferable to the establishment of private schools. 


The high rate of return from superintendents may indicate a considerable level 
of interest, and perhaps also of concern, regarding private school growth. Private 
schools are located in about two-thirds of the jurisdictions under public school su- 
perintendents. Some of the schools enroll only a few pupils, some have become 
large institutions enrolling several hundred. 


Table 4 contains a compilation of responses to the second questionnaire. 
Table 5 provides arbitrary numerical weights applied to ranges in percent of re- 
spondents who agreed with each of the statements, where +3 indicates 85% or 
more of the respondents of a group agreeing, and -3 less than 15% agreeing (or 
85% or more disagreeing) with a statement. A weight of 0 has been given to a range 
of 45 to 54% of respondents of a group agreeing with a statement, that is, where 
about half of the respondents of a group agreed and about half disagreed with a 
statement. 

Wherever a respondent group differed substantially from the other three 
groups, such is indicated by an asterisk. ‘Thus it can be seen that for about half of 
the statements, the private school principals differed substantially from the other 
groups; public school superintendents in questioning the useful function of private 
schools; and the board chairmen in questioning the historical relevance of private 
schools. 

General agreement was found among respondent groups with respect to the rel- 
evance which the following factors have to issues relating to private schools: 


the historical context (statement #1); 


2 the freedom of choice (separate school superintendents more so than did the 
public) (#2); 

oF the needs of individuals and minority groups (separate school more so than pub- 
lic school superintendents) (#3); 

4, the state of public education (#6); 

5. the degree of provincial responsibility and control (#7); 

6. the impact of the establishment of private schools upon district school programs 
(#8 & #9); and, 

fe the requirement of certification for all teachers (#12). 


Less certainty of opinion between groups was expressed with respect to the fol- 
lowing: 


the justification of public funding for private schools (#4); 


zi, the relevance of the constitutional provision for Catholic separate schools to the 
preference of other denominations in schooling (#5); 

3. the incorporation of private schools as alternative schools under district boards 
(#10); 

4, an open admission requirement for private schools (#11); 

5 the requirement of uniform certification for all teachers to include those in pri- 


vate schools (#12); and, 
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Table 5 
Weighted Opinion of Respondent Groups 


Group 
Statement PS SS C PP Consensus 
1. Historical relevance 3 i 3. strong 
2. Freedom of choice criterion 3 3 3 strong 
3. Useful function of private schools he 3 2 3 general 
4. Grants to private schools justified =| 1 0 3* low 
5. Constitutional precedence of R.C. Separate mal 2 1 3 low 
6. Need to assess public schools 2 3 3 3. strong 
7. Need more provincial control of private schools 3 3 duamin veneral 
8. District boards require advance notice 3 3 3 O* general 
9. Boards require undiminished grants to cover loss of 3 2 2  —2* general 
pupil grants 
10. Private schools as alternative under boards i 1 1 -—2* general 
11. Open admission policy 3 i 3) la low 
12. Uncertificated teachers in private schools ae 0)! Gea ae Lee eeueral 
13. Commitment of public teachers vs. private 1 0 1° = 2*¢ general 
14. Commitment of public parents vs. private =] 0 QO —2 general 
15. Commitment of public students vs. private 0 1 QO —2 general 
Groups: 
PS-public sch. supts. Numerical Range in % of respondents 
SS-separate sch. supts. scale in agreement with statement 
C-board chairmen a 85-100 
PP-private sch. principals a 70-84 
1 55-69 
*Respondent group differs 0 45-54 
substantially from other | 30-44 
three groups =-2 15-29 
=< 0-14 
6. the commitment and involvement of teachers, parents and students in public as 


compared to private schools (#13, #14, & #15). (Private school principals 
claimed the commitment to be somewhat higher for private than for public 
schools. Public school superintendents tended to agree with private school 
principals that their parents were not as committed as those for private schools.) 


Obviously, smaller districts are more sensitive to the loss of grant revenues and 
curtailment of programs upon losing students to private schools than are the dis- 
tricts with larger student populations. Quite understandably, public school districts 
would like to be able to plan necessary changes prior to the loss of students to a 
private school, and therefore would like to receive adequate notice. At least some 
districts would want the opportunity of meeting with parents wishing to establish a 
private school in order to ascertain whether their particular needs could be met 
within the existing schools. Some districts would want to try to meet the needs of 
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minority groups through alternative programs within the schools, and a few would 
consider operating alternative schools in preference to seeing the establishment of 
private schools. However, many superintendents regretted that removal of students 
from the public schools was deemed to be necessary by some parents. Some 
acknowledged that an assessment of their school programs and activities is 
warranted when some residents give strong reasons for removing their children in 
order to enroll them in a private school of their choice. 


The Gallup Poll on Alberta Education indicated a generally liberal acceptance 
of alternatives in education such as that provided by private schools. The recom- 
mendations of the Woods Gordon (1984) consultants were supportive of such alter- 
natives. In this study, respondents representing Catholic separate schools, 
cognizant of a right enjoyed by one denominational group, were inclined to express 
a more liberal view towards the support of private schools. Furthermore, there was 
fairly strong consensus on the part of all respondent groups that the freedom of 
choice criterion has relevancy in both public and private education. 


Major issues, rather, concern more the quality of education offered to children 
than the question of whether public funding ought to be directed to private school- 
ing. These issues were also raised at a conference on education finance in Alberta 
by Atherton (1985) in the consideration of concepts of equity. Whereas private 
schools serve the needs of a selective clientele, the needs of the disadvantaged mi- 
norities may need to be met largely by the public schools. However, in fairness it 
must be said that the needs of some of the severely handicapped were being served 
(and some still are) by private schools established for that purpose before an 
enlightened public conscience included their needs within the public domain. 


Conclusion 


During 1985 about 97% of all Alberta elementary and secondary pupils were 
enrolled in schools under elected district boards. That suggests that any impact of 
the growth of private school enrollment would be felt at the local district rather 
than at the provincial level. It would seem, then, that the impact which public 
funding of private schools has on public schools ought to be of concern to provin- 
cial education authorities who might be able to provide measures of support 
towards the alleviation of any disadvantages suffered by public school systems. The 
question, whether or not the establishment and operation of private schools is to be 
supported through public funding, is a matter of government education policy in 
part wrought through the political pressure of the private schools sector electorate. 
A governmental disposition to grant such support is not unrelated to prevailing 
societal and political values. For example, though a bitter public debate raged in 
Manitoba following the MacFarlane Commission’s 1959 recommendations regard- 
ing an introduction of public funding for private schools, a modest measure of such 
funding was introduced without much controversy about 20 years later. How- 
ever, by that time the other three western provinces and Quebec had introduced 
varying programs of public funding of private schools. Currently the controversy is 
centered in Ontario, where the Shapiro Commission has made recommendations 
which, if implemented, would alter considerably the status of private education in 
that province. 


As in the other provinces, since the beginning of this century, public education 
has enjoyed phenomenal growth and its services have become available to every 
child in Alberta. Perhaps in this decade decisions are being initiated which may 
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substantially alter the future provincial landscape in education. Private schools, at 
one time practically ignored or merely tolerated, have gained a political and educa- 
tional prominence out of all proportion to the number of students they serve. It ap- 
pears that for the first time in the education history of some provinces, including 
Alberta, public school systems as the sole providers of educational services to all 
clients is being put in question, and that a societal climate is emerging in which the 
partial privatization of educational services could become a reality. The concern of 
committed public educators must be whether, in the long run, the consequences of 
such an eventuality may yield a net loss to education. 
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Eliciting the Language Function of 
Forecasting/Reasoning in Elementary 
School Classrooms 


The purpose of this study was to explore possible methods which might be used with- 
in the elementary school classroom to elicit the language function of Forecasting/Rea- 
soning, and to examine the performance of children of various ages in their use of 
language for this function. Kindergarten, grade three, and grade six students engaged 
in a problem-solving activity under one of three treatment conditions: the activity 
alone, the activity including a language model, and the activity including the 
researcher’s use of a questioning strategy. Results indicated a significant difference 
between grade three and grade six at the .01 level of significance. Grade three stu- 
dents used significantly more language for Forecasting/Reasoning. Results also indi- 
cated that the questioning strategy was significantly more successful than either the 
activity alone or the language model in eliciting Forecasting/Reasoning. No signifi- 
cant interaction effect was found between grade level and treatment. 


Success in learning within the school environment is closely linked to skill in 
oral language (King, 1984; Shafer, Staab, & Smith, 1983). Those children who talk 
easily, make their questions clear, and get their meanings across to the teacher re- 
ceive more attention and can influence more successfully the flow of instruction to 
meet their needs (King, 1984). Central to the issue of talk and getting one’s mean- 
ing across is the concept of language function (i.e., the purpose for which one uses 
language). Adequate command of oral language includes the ability to use language 
for a variety of functions such as to inform, to request, or to reason within appro- 
priate contexts. Verbal interaction in classrooms requires functional competence 
(Wilkinson, 1985). The purpose of this study was to explore possible methods 
which might be used within the elementary school classroom to elicit one specific 
language function, that of Forecasting/Reasoning, and to examine the performance 
of children of various ages in the use of language for this function. 

Halliday (1973) has proposed the concept of language function as the seminal 
idea for language development, as evidenced in the following statement: “The 
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child’s awareness of language cannot be isolated from his awareness of language 
function, and this conceptual unity offers a useful vantage point from which lan- 
guage may be seen in a perspective that is educationally relevant” (p. 20). Halliday 
has advanced the notion that language function is central to a child’s language de- 
velopment and that the development of syntax and vocabulary, about which educa- 
tors appear to be so highly concerned, springs from a functional base. Children 
understand what language is because they understand what language does (Yalden, 
1981; Loban, 1979; Halliday, 1975). 


Coupled with the notion that children understand language because they under- 
stand its function, is the recent finding indicating that children need to use lan- 
guage for a variety of functions within the school setting (Pellegrini, 1984; 
Chapman, 1981; Hunter-Grudin, 1979; Halliday, 1978; Tough, 1973). Children’s 
command of a large repertoire of functional uses of language is directly related to 
their ability to communicate effectively in a number of different contexts and with 
many different kinds of people (Pellegrini, 1984). 


If children need to use language for a variety of functions within the school set- 
ting then it seems necessary to list and describe these language functions. Tough 
(1979, 1973) has listed seven specific language functions which her research indicat- 
ed are necessary for school success. Tough’s seven language functions are: 


1. Self-Maintaining (i.e., justifying and supporting the self, as in expressing needs 
and justifying behavior—“It’s my turn.”) 

2. Directing (i.e., requesting or demanding something—‘“Please hand me a pen- 
ei, 
Reporting (i.e., giving information—“That shape is square.”) 
Towards Logical Reasoning (i.e., drawing conclusions and stating cause and ef- 
fect relationships—“The bridge collapsed because the log was too short.”) 


5. Predicting (i.e., speculating about the future—“It’s going to rain tomorrow.”) 
Projecting (i.e., projecting into the feelings of others—“She is feeling un- 
happy.”) 

7. Imagining (i.e., any of the above said in a pretend context or when fanatasizing 
out.of a real-life situation—“I’m a queen.”) 


Even though these language functions have been deemed necessary for school 
success there has been little suggestion in the literature as to how children might 
be encouraged to use language for these functions or little exploration into specific 
factors of the classroom situation which tend to elicit these functions. As language, 
including language functions, is always context specific (Garvey, 1984; Shuy, 1984; 
Wells, 1981), one answer lies in developing contexts within the classroom for which 
particular functions of language are appropriate. 


In a previous study, the researcher developed several appropriate contexts by 
designing a series of activities which were designed to elicit each of Tough’s (1979, 
1973) seven functions of language (e.g., a specific activity to elicit Self-Maintaining, 
a specific activity to elicit Directing, etc.). Results of this study matching activity 
with function indicated that it was possible to design an activity which elicited a 
significant amount of language used for a specific language function (Staab, 1983). 

Central to each activity designed to elicit a language function was what was 
termed the essence of that activity (Staab, 1983). The essence of an activity was 
that key factor within the activity which was actually related to the elicitation of 
the desired language function and which could generalize to other similar activities 
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(i.e., those activities eliciting the same language function). For example, the essence 
of the activity designed to elicit the language function of Towards Logical Reason- 
ing was solving a problem, yet the actual activity might be a science experiment to 
determine which of a number of objects would float. Shafer, Staab, and Smith 
(1983) conducted a two-year field study in a metropolitan area of the southwestern 
United States and determined that various activities which included the same es- 
sence did elicit similar language functions. By using the essence, countless activities 
of varying degrees of difficulty could be designed to elicit the same language func- 
tion. 


Previous studies further indicated the following results which are germane to 
the present study. The language functions of Self-Maintaining, Directing, and Re- 
porting occurred to a reasonable degree over all activities, while the language func- 
tions of Towards Logical Reasoning, Predicting, Projecting, and Imagining were 
highly concentrated in only one or two activities and virtually absent in others 
(Staab, 1983). This finding concurred with the findings of Pinnell (1975) who ana- 
lyzed the language used by grade one pupils throughout the school day regardless 
of activity. It was determined also that the activity designed to elicit Towards Logi- 
cal Reasoning and the activity designed to elicit Predicting tended to elicit both 
language functions within the same activity (Staab, 1983). It was concluded that it 
is often necessary to make predictions before one actually attempts to solve a prob- 
lem (ibid.), thus Tough’s (1979, 1973) language functions of Towards Logical Rea- 
soning and Predicting were collapsed into one category, labeled Forecasting/ 
Reasoning. Tough’s original category system was further modified and some func- 
tions were renamed as a result of research (Staab, 1983; Shafer, Staab, & Smith, 
1983). The resulting category system with language examples appears in Table 1. 
Self-Maintaining was modified and renamed Asserting and Maintaining Social 
Needs; Directing was modified and renamed Controlling; and Reporting was modi- 
fied and renamed Informing. The category of Imagining was deleted because it in- 
cluded any of the other language functions which were used in a make-believe 
context and that distinction was not necessary in the present study. The category 
system in Table 1 was used in the present study. 

The general focus of the present study was to concentrate on the language func- 
tion of Forecasting/Reasoning. Any utterance which could be categorized as one of 
the eight subfunctions appearing under the heading of Forecasting/Reasoning in 
Table 1 was termed a Forecasting/Reasoning statement. The unit of analysis and 
specific classification criteria are explained in the Methods section. Forecasting/ 
Reasoning was chosen as a focus because this function is emphasized within the 
elementary school curriculum and because the elicitation of this function requires 
highly specific contexts (Staab, 1983; Pinnell, 1975). Forecasting/Reasoning was ex- 
amined within the elementary school setting and three specific objectives were in- 
cluded. These objectives were as follows: 


Developmental Differences 

Previous studies were conducted using kindergarten and grade one children as 
subjects (Staab, 1983; Pinnell, 1975). It was found that in an activity specifically 
designed to elicit Reasoning, kindergarten children used 9.25% of the total lan- 
guage elicited for the desired function (Staab, 1983). Although significantly more 
language for the function of Reasoning was used in this activity than in any of the 
other activities, the percentage is low when compared with the proportion of lan- 
guage used for the other functions listed in Table 1 within their designed activities 
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Table 1 


Five Functions of Language with Subfunctions and Examples 


Function 1: Asserting and Maintaining Social Needs 


Subfunction 


Li 


Asserting personal rights and/or 


needs: 


Asserting negative expressions: criti- 
cizing, arguing, threatening, and giv- 


ing negative opinions: 


Asserting positive expressions: 


Requesting an opinion: 


Incidental expressions: 


Function 2: Controlling 


Subfunction 


he 
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Controlling actions of self and 


others: 


Requesting directions: 


Requesting another’s attention: 


Examples 

a. I want some juice. 

b. Give it to me, it’s mine. 

c. Im first ’cuz I’m the oldest. 

d. Ihit him because he hit me first. 
e. Ineeda blue crayon. 


You're talking too much. 


Stop it or I’ll tell. 
That looks dumb. 


a. Yes, I think so too. 
b. I like your building. 


a 
b. Itold you to quit it. 
c 
d 


c. It tastes good to me. 
a. Do you like this? 


a. Oh, gee. 
b. Good grief. 


Examples 

a. Turn it around (to self). 

b. Get one egg. 

c. Give me the blue one. 

d. Put your name at the top and Ill 
give it to the teacher. 

a. Howdo you do this? 

b. Where shall I put this? 

a. Watch this. 

b. Look here. 


Function 3: Informing 
Subfunction 


1. Commenting on past or present 
events: labeling, noting details, not- 
ing specific incidents, and/or noting 
sequence (includes statements made 
in both first and third person: 


2. Comparing: 
3. Making generalizations based on 
specific events and details: 


4. Requesting information: 


Function 4: Forecasting and Reasoning 


Subfunction 


1. Explaining a process: 


2. Recognizing causal and dependent 
relationships: 


3. Recognizing problems and _ their 


solutions: 
4. Justifying judgments and actions: 


5. Drawing conclusions: 


6. Recognizing principles: 


7. Requesting a reason: 


8.  Anticipating/forecasting (in terms of 
anticipating problems, possible 
solutions, alternative courses of 
action when problem is spotted, and 
predicting consequences): 


Language Function of Forecasting/Reasoning 


Examples 

a. That’s.a car. 

b. It’s blue and white. 

c. I (the boy) have (has) some paint. 

d. Iput the red on before the yellow. 

a. The first bus is longer than the sec- 
ond. 

a. My brother is sick today. 

b. The cars were in an accident. 

a. What color is this? 

b. Is this one longer? 

Examples 

a. When you break your arm they put 
something on it like a bandage and 
plaster. 

a. If the bridge is low, boats can’t go 
under and it’ll have to open in the 
middle to let them out. 

a. This box isn’t big enough to make a 
garage to so I’m going to make it 
with blocks. 

a. I’m cutting this box to make doors 
’cos we use doors to put cars in. 

a. I don’t think we better take the 
clock to pieces because we’d never 
get it together again. 

a. Iwanta bike, but I can’t have one 
because it’s too dangerous. 

a. Why does this work? 

a. That brick won’t be long enough, 
I’ll get some wood. 

b. We could make a bridge or we 
could get a long ladder. 

c. That propellor will fall off, if you 


put it on that way. 
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Function 5: Projecting 


Subfunction Examples 

1. Projecting into the feelings and reac- a. He’s feeling sad. 
tions of others: b. He’s probably mad about it. 

2. Projecting into the experiences of ~ a. I wouldn’t like to live in the zoo like 
others: tigers. 


b. That boy’s parents are fighting and 
he is caught in the middle. 


(ibid.). Given the Piagetian notion (Piaget & Inhelder, 1969; Piaget, 1926) of a 
child’s cognitive development, one must speculate as to whether or not the liberal 
use of language for Reasoning is a matter of cognitive development. The present 
study explored the language of children at roughly the ages Piaget would term 
Preoperational, Concrete Operational, and at the level of Formal Operations in an 
effort to determine differences in the use of language for Forecasting/Reasoning. 


Influence of Modeling 


The influence of modeling was determined by including a language model in a 
small group activity. Within the context of the present study, a language model was 
defined as a person using language liberally for the function of Forecasting/Reason- 
ing. Wells and Wells (1984) have stated that children learn most effectively when 
there is an opportunity for collaborative talk with the teacher and peers, and 
Cazden (1983) has noted that language models are valuable in helping children 
learn particular discourse forms needed within the school setting. This study at- 
tempted a more scientific investigation of these theories. Specific information on 
the language modeling technique is presented in the Methods section. 


Influence of a Questioning Strategy 


Athey (1983) has proposed that some discussion time in classrooms might be 
spent more productively with the teacher deliberately posing problems to help stu- 
dents sharpen their concepts. The present study explored the possibility that a 
questioning strategy designed to elicit Forecasting/Reasoning would increase 
students’ performance in the use of this language function within an appropriate 
activity (i.e., an activity specifically designed to elicit Forecasting/Reasoning). The 
questioning strategy is discussed in the Methods section. 

A desired exploration of age differences in the use of Forecasting/Reasoning, 
and the effects of modeling and of a questioning strategy led to the formation of 
three specific hypotheses. 


Specific Hypotheses 


Children in kindergarten, grade three, and grade six worked in groups of three 
with others of comparable ages on activities appropriate to their grade level and 
specifically designed to elicit Forecasting/Reasoning (i.e., the activities contained 
the essence of problem solving). During one treatment, hereafter termed “Activity,” 
the three children engaged in a problem-solving activity by themselves. During a 
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second treatment, hereafter termed “Modeling,” the researcher joined the group of 
three children as a language model within the course of the activity. During the 
third treatment, hereafter termed “Questioning,” the researcher joined the three 
children during the activity and engaged in a specific questioning strategy. These 
treatments are explained in detail in the Methods section. All activities were 
videotaped and a written transcript made from the tape. The resulting language 
was segmented into separate utterances and categorized using the system described 
in Table 1. The study tested the following hypotheses: 
I The amount of language used for Forecasting/Reasoning will differ over grade 
level averaged over all treatments. 
2 The amount of language used for Forecasting/Reasoning will differ over 
treatments averaged over all grade levels. 


3 The amount of language used for Forecasting/Reasoning will differ across grade 
‘level across treatment. 


Method 
Subjects 


Subjects were 90 kindergartners, 90 students from grade three, and 90 students 
from grade six. The subjects were from three elementary schools located within a 
suburban area of Vancouver, British Columbia. The three schools were chosen by 
the superintendent of the school district because of their similarity in socioeco- 
nomic factors, each school drawing students primarily from suburban, middle class 
neighborhoods. Thirty students from each grade level were taken from each school. 

Classroom teachers were asked to choose subjects on the basis of three criteria: 
(a) the subject was fluent in English, (b) he/she seemed to be an adequate user of 
language in many situations, and (c) he/she appeared to use language occasionally 
for Forecasting/Reasoning. Karlin (1980) has stated that teachers’ opinions of stu- 
dents’ abilities are as accurate as are the results of standardized tests and perhaps 
more accurate. The data were collected near the end of the school year so it was as- 
sumed that teachers were able to choose subjects according to the criteria. 


Design 

The subjects within each school, and within each grade level (i.e., kindergarten, 
three, and six) were assigned randomly to groups of three, hereafter termed triads. 
Although randomly assigned, students were not separated from members of either 
their own grade or their own class. As there were 30 subjects per grade level per 
school, 10 triads per grade level per school were formed. Each of these 10 triads 
was then assigned randomly to one of three treatments: (a) Activity, (b) Modeling, 
or (c) Questioning. These treatments are discussed further in the Procedure sec- 
tion. As there were three treatments and 10 triads, the final triad at School One at 
each grade level was assigned randomly to one of the three treatments. The final 
triad at School Two at each grade level was then assigned randomly to one of the 
two remaining treatments, and the final triad at School Three was assigned to the 
remaining treatment. Because subjects were recruited from schools with similar 
populations, the researcher believed that “School” might be eliminated as a vari- 
able, however this notion was subject to investigation. 

The original research design was a 3 x 3 fully-crossed MANCOVA (grade level x 
treatment). Following each of the treatments the triad engaged in an art activity 
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during which the language used for Forecasting/Reasoning was designed to serve as 
a potential covariate. 


Procedure 


Ten triads per grade level per school engaged in one of three treatments: 
(a) Activity, (b) Modeling, or (c) Questioning. All treatments were videotaped and 
took place in a small conference room at the subjects’ respective schools. Prior to 
each treatment session, the researcher met with each triad in an effort to establish 
rapport and demonstrate the video equipment. The treatment sessions were 
videotaped by a trained research assistant, as the researcher was a participant in 
both Modeling and Questioning. Each triad engaged in one treatment only once 
and all treatments were completed within a single school before the researcher and 
the research assistant moved to another school. 


Subsequent to the videotaping, a written transcript was made by a second re- 
search assistant and checked for accuracy by the researcher. 


Treatments 


Treatment 1: Activity was an activity specifically designed and tested to elicit 
the language function of Forecasting/Reasoning (Shafer, Staab, & Smith, 1983). 
The essence of the activity was solving a particular problem which had been posed 
by the researcher. As previously mentioned, the essence of an activity is that 
specific element which elicits the language function and which can be generalized 
to other similar activities. The problem was presented verbally to the triad and all 
the needed materials to discover its solution were made available. The subjects 
were then allowed to work on the problem without any further interaction with the 
researcher. The activity was concluded when the subjects indicated by words or 
actions that they had finished. Three different activities were used, one for each 
grade level. As the essence of all activities was solving a specific problem, it was un- 
likely that what would present a problem to kindergartners would also present a 
problem to subjects in grade six. The fact that the three activities were “equal” in 
their ability to elicit Forecasting/Reasoning has been previously discussed. The 
specific activity used at the kindergarten level was a float/sink experiment. Kinder- 
garten triads were given a number of objects and asked to determine which would 
float in a small tub of water. If certain objects did not float, the kindergartners 
were asked to devise ways to make them float. Blocks of wood and bottles with 
caps were provided for that purpose. The activity at the grade three level was to 
determine which objects would conduct an electrical current. A circuit using a bat- 
tery and a small light bulb was devised. Students were given a number of objects 
which would conduct electricity and a number which would not conduct electricity. 
The grade six subjects were provided with a model of a city on which were placed 
movable cardboard houses, schools, parks, building, an airport, and so forth. 
Subjects were to decide how the city might be designed in terms of efficiency, con- 
venience, and transportation patterns, and then to place the movable parts accord- 
ingly. 

The following is a sample from a sixth grade transcript. G, T, and B are 
subjects. 


G. Move the city hall to the right because it should be closer to downtown. 
T. No, it’s good there. 
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B. I don’t like all the parks together. 


Treatment 2: Modeling was the same activity per grade level as used in 
Treatment 1: Activity (ie., kingergarten/float-sink, grade three/electrical circuit, 
and grade six/model town). However, during Modeling, the researcher joined the 
triad as a fourth participant. The researcher made one utterance which could be 
classified as Forecasting/Reasoning every 30 seconds. Sample Forecasting/Reason- 
ing statements appear in the example section of Table 1 under the heading 
Forecasting/Reasoning. These utterances fit the context of the conversation and 
were germane to what was happening at the specific time. 


The following is a sample from a sixth grade transcript. R is the researcher and 
M and N are subjects. 


R. I don’t think we should put the airport downtown because land is too expensive. 
M. How about in the right-hand corner? 
N. No, over by the dump. 


Treatment 3: Questioning was the same activity per grade level as Treatment 1: 
Activity (ie., kindergarten/float-sink, grade three/electrical circuit, and grade 
six/model town). During Questioning, however, the researcher joined the triad as a 
fourth participant and asked a question designed to elicit Forecasting/Reasoning 
every 30 seconds. Sample questions are contained in Table 2. 


Table 2 
Questioning Strategy to Elicit Forecasting and/or Reasoning 
Strategies Examples 
1. Questions to elicit explaining a process: a. What happens when you break your 
arm? 
a. Questions to elicit recognizing causal b. Ifthe bridge is too low, what will hap- 
and dependent relationships: pen to boats? 
b. Questions to elicit recognizing problems c. What’s the matter here? 
and their solutions: What can we do about it? 
c. Questions to elicit justifying judgments d. Why are you doing that? 
and actions: 
d. Questions to elicit drawing conclusions: e. Why do you suppose this is happening? 
e. Questions to elicit recognizing prin- f. What is the general reason why all these 
ciples: work? 


f. Questions to elicit a reason. Why do you think so? 


g. Questions to elicit anticipating/forecast- h. What will happen if... 
ing: What could we do now? 


a 
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The following is a sample from a sixth grade transcript. R is the researcher and 
P and H are subjects. 
R. Why should the housing development be close to a park? 
P. Because kids need a place to play. 
H. Yeah! 


Following each treatment, each triad at all grade levels engaged in an art activ- 
ity without the researcher involved. Each triad was given one large sheet of paper 
and felt tip pens and then asked to cooperatively draw -a group picture. The art 
activity was also videotaped, a written transcript made, and the resulting language 
coded according to the functions listed in Table 1. The number of utterances used 
tor the function of Forecasting/Reasoning was used as a potential covariate. As the 
amount of language used for the function of Forecasting/Reasoning is not prevalent 
over all school activities (Staab, 1983; Pinnell, 1975), an art activity which entailed 
some amount of planning was devised and it was believed that this art activity 
would elicit some language used for Forecasting/Reasoning. 


Unit of Analysis 
The written transcript from the videotape was divided into statements which 

were used as the unit of analysis. Statements were defined a short segments of talk 
and identified as according to the following criteria: (a) a statement is composed of 
a word, a series of words, or a sentence; (b) a statement must contain a clear 
thought or idea; (c) a statement need not be “grammatical” in the formal sense; and 
(d) immediate repetition of a word or words which express the same idea are not 
counted as separate statements. In the following transcript, used for illustration, 
statements are marked with a slash (/). 

Context: (The children were testing objects in a pan of water to determine whether or not 

the objects would float.) 


S. Oh that won’t float/(looks at the object before testing) 
J. Un, un/(shaking his head indicating “no”) 
T. Let’s see if this will float/(holds up object). It does, it does/ 


S. If you want to make this float, you’re going to have to strap it to some wood/(holds up 
nail). 


Language Categories 

Each statement was categorized according to the category system which appears 
in Table 1. The total number of statements per function per triad was calculated. 
Statements were categorized according to the full range of language functions (i.e., 
Asserting and Maintaining Social Needs, Controlling, Forecasting and Reasoning, 
and Projecting) even though the researcher was primarily concerned with 
Forecasting/Reasoning. 


Classification Criteria for Coding Language by Function 


In order that statements may be coded for function according to the system 
presented in Table 1, classification criteria needed to be established. Statements 
were coded for function using a combination of the following criteria: (a) literal 
content—subject, predicate, adverbial phrases, and other constituents, (b) intona- 
tion, facial expressions, and gestures of the speaker, (c) the nature of the activity 
and what was happening at a given time within the activity, and (d) the verbal and 
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nonverbal reactions of the speaker to the listener’s response. In assigning function, 
for example, one is attempting to discover the speaker’s purpose for speaking and 
often the actual words spoken do not reveal the intended function. If a speaker 
said, “I’m cold,” this might simply be a statement of physical condition, or it might 
be a request to close the window or turn up the heat. The above criteria in combi- 
nation were used to deterine the function. As all subjects, the researcher, and all 
research assistants were members of the same speech community, it was assumed 
that statements could be identified on the basis of these criteria. 


Interrater Reliability 


A separate research assistant from those persons who had videotaped and made 
the written transcript was employed to divide the language into statements, and a 
fourth research assistant was employed to categorize the statements by function. 
The researcher provided numerous practice transcripts and training sessions in the 
concept of a statement and the category system. Following these practice sessions, 
the research assistants independently segmented the language and categorized it by 
function. Subsequently, the researcher independently categorized a randomly se- 
lected sample of one-third of the transcripts, and the interrater reliability was 
found to be .96. Both of the research assistants were ignorant of the study’s pur- 
pose or the proposed hypotheses. 


Results 


Initial interrogation of the data considered all language functions listed in 
Table 1 even though the researcher was primarily interested in the language func- 
tion of Forecasting/Reasoning. Initially, Canonical Correlation Analysis was used to 
indicate association of variables. Canonical Correlation Analysis provides informa- 
tion on the relations between sets of variables and is often used as a first step in 
indicating the degree of association between these variables (Pedhazur, 1982). A 
Canonical Correlation Analysis was performed between the three grouping vari- 
ables (i.e., school, grade, and treatment) and the eight language variables; which in- 
cluded four language functions under two conditions, problem-solving activities, 
hereafter termed “Science,” and Art activities (i.e., Social Needs in Science (SNS), 
Controlling in Science (CS), Informing in Science (IS), Forecasting/Reasoning in 
Science (FRS), Social Needs in Art (SNA), Controlling in Art (CA), Informing in 
Art (IA, and Forecasting/Reasoning in Art (FRA)). The language function of 
Projecting was nearly nonexistent in the transcripts and therefore not subject to 
analysis. Multivariate normality assumptions were upheld by acceptable skewness 
measures; there were no missing data, multi-collinearity and singularity were ab- 
sent from the squared multiple correlations of both sets; and plots of the canonical 
variates revealed no departures from linearity. However, kurtosis values indicated 
the presence of multivariate outliers within the variables: SNS (6.44), CD (4.15), IS 
(3.01), FRS (11.36), and SNA (.76). Seven of the 90 scores had z values greater than 
plus or minus three, and these scores were adjusted through the substitution of the 
adjusted means for the extreme scores (i.e., the data were Windsorized). Canonical 
Analysis, as in other multivariate techniques, is extremely sensitive to the presence 
of outliers (Tabachnick & Fiddell, 1983). 

A subsequent Canonical Correlation Analysis with the adjusted data produced 
three statistically significant canonical correlations. The first canonical correlation 
was .82 (68% of the variance); the second was .76 (58% of the variance); and the 
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third was .48 (23% of the variance). Chi-square values were X?(24) =187.61, p<.001 
for the first correlation; X?(14)=93.59, p<.001 for the second; and X?(6)=22, 
p<.001 for the third. Analysis of the separate canonical variates is shown in 
Table 3. 


Table 3 
Correlations, Standardized Canonical Coefficients, Canonical Correlations 
Percents of Variance, and Redundancies Between School, Grade, Treatment, 
and Language Functions and Their Corresponding Canonical Variates 


First canonical Second canonical Third canonical 
variate variate variate 
Correl. Standardized Correl. Standardized Correl. Standardized 
Coeff. Coeff. Coeff. 

Grouping set 

School 227 .209 = Ay = 951 949 

Grade .820 .820 — .485 — 0.485 = 108} = 08 

Treatment By O29) 841 849 .096 .064 

Percent of 34 130 .34 Total=1.00 

Variance 

Redundancy 25 19 07 Total=.49 
Language Set 

SNS AD) 343 == 1B =—.089 ily 

CS .647 A479 S83) = ith 203 

IS .009 — 220 338 1.006 .556 

FRS 492 —620 — 469 = nef! 442 

SNA 518 154 = 156 = tld) 426 

CA 605 .268 Saito —.156 ATI 

IA satoitl piles 101 BO2 .687 

FRA 194 oli 195 2 == il 

Percent of 22) 06 all Total=.49 

Variance 

Redundancy 15 04 05 Total=.24 

Canonical .68 58 a 

Correlation 
Note: 
SNS—Social Needs: Science; CS=Controlling Science; IS= Informing: Science; 


FRS=Forecasting/Reasoning: Science; SNA=Social Needs: Art; CA=Controlling: Art; [A=Informing: Art; 
and FRA = Forecasting/Reasoning: Art. 


Interpretation of the first canonical variate, with .4 used as the cutoff value, 
showed high correlation for Grade (.82) and Treatment (.532). The relevant lan- 
guage variables of the second set of variables of the first variate were, in order of 
magnitude, CS, CA, SNS, SNA, and FRS. The second canonical variate showed 
high correlation with Treatment (.84) and Grade (-.485). Among the language vari- 
ables, only FRS had a high correlation. As expected, the third canonical variate had 
a high correlation only with School and was relevant to all language variables of the 
second set except CS and FRA. This initial Canonical Correlation Analysis 
suggested adjustments to the statistical procedures to enhance interpretability. 
Both Grade and Treatment received significant loadings on the first and second 
variate. The interpretability of grouping variables is enhanced through the use of 
Effect Coding (Pedhazur, 1982). Effect Coding was used to partition the levels of 
School, Grade, and Treatment in an effort to separate Grade and Treatment. The 
results of this second Canonical Correlation Analysis are shown in Table 4. 
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Table 4 
Correlations, Canonical Coefficients, Canonical Correlations 
Percents of Variance, and Redundancies Between School, Grade, 
Treatment, and Language Functions and Their Corresponding 
Canonical Variates (Effect Coded) 


First canonical variate Second canonical variate 
Correlation Coefficient Correlation Coefficient 

Grouping set 
School 1 a ORO Ney .898 .782 
School 2 = .160 =Ow 610 PS 
Grade: Kindergarten == (07K0) 0.0 .898 0.0 
Grade: 3 —.160 0.0 .610 0.0 
Treatment: Questioning .986 1.041 148 —.089 
Treatment: Modeling 408 = ilies A401 436 
Percent of Variance .20 42 
Redundancy ie 14 

Language Set 
SNS 167 — .090 SBD — .647 
CS 183 = URY oil 499 
IS melo —.706 = 505) acetal 
FRS .687 1.081 ley! sales! 
SNA = ileAel 041 = 621 aa. 
CA = LAd .041 = lol —A25 
IA =e = Al? (45,0) = .467 
FRA —=.208 —.190 = lM .496 
Percent .095 ‘33 Totals: .425 
of Variance 
Redundancy .06 wlll Totals: .17 
Canonical Correlation 61 aay) 

Note: 


SNS—Social Needs: Science; CS =Controlling Science; IS= Informing Science;FRS = Forecasting/Reasoning: 
SNA=Social Needs: Art; CA=Controlling: Art; [A=Informing Art; and FRA= Forecasting/Reasoning: Art. 


The first canonical correlation was .78 (61% of the variance); X?(32)=119.69, 
p<.001. The second canonical correlation was .57 (32% of the variance; X*= 41.43, 
p<.005. No other canonical correlations were significant. 

The first canonical variate showed high correlations on the treatments 
Questioning (.986) and Modeling (.408). The second canonical variate showed that 
Grade was not pivoted in the analysis, indicating an identity relation with School. 
Although statistically significant, the second variate is uninterpretable due to the 
high correlations on all of the language variables of the language set except FRS 
and FRA. It should be noted, however, that the interpretability of the relationship 
of Treatment to the language variables is an important result. 


Given that Grade and Treatment were in close association on the initial analy- 
sis, and School and Grade were associated on the Effect Coded analysis, a3 x 3 x 3 
Fully-Coded Factorial MANOVA was run to test for School x Grade; School x 
Treatment; Grade x Treatment; and School x Grade x Treatment interaction with 
the language variables. The statistical package used was BMDP4V (Dixon, 1983) 
with a factorial design for the analysis. The results indicated that there were no 
significant interaction effects for the grouping variables (i.e., School, Grade, 
Treatment). Grade was shown to be significant F(16, 112)=8.10, p<.001 and with 
significant measures on all language variables except IA and FRA. Treatment 
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effects were significant, F(16, 112)=7.10, p<.001; however among the language 
variables only IS, F(2, 63)=5.62, p<.0062, and FRS, F(2, 63)=21.86, p<.001 
showed significant treatment effects. School was not significant at the .01 level, 
thus eliminating it as a variable. This result did not conflict with the findings indi- 
cated in Table 3 because only the third canonical variate, which was 
uninterpretably due to the high correlations on all language variables except CS 
and FRA, was correlated with school. 


Initial interrogation of the data also included an item by item analysis of 
variance to determine if FRA was a covariate with FRS—the problem-solving 
activity. The mean for FRS across all grades and treatments was 22.1 in terms of 
the number of statements categorized as Forecasting/Reasoning, while the mean for 
FRA across all grades and treatments was 2.72. A histogram of both variables indi- 
cated a range of 0-70 with 31 observations above 50 for FRS, and a range of 0-12, 
with 23 observations of zero for FRA. The correlation between FRS and FRA was 
.006. It was determined that language categorized as Forecasting/Reasoning during 
the art activity could not be used as a covariate with language categorized as 
Forecasting/Reasoning during the problem-solving activity (science). Accordingly, 
the language functions for science were separated from those for art in subsequent 
analysis. 

As the study proposed three hypotheses, the following results relate more 
specifically to those hypotheses. Initial interrogation of the data using Canonical 
Correlation Analysis indicated general associations of the variables and the need 
for subsequent interrogation of the data to more specifically address the 
hypotheses. 


Hypothesis 1: That the Amount of Language Used for Forecasting/Reasoning 
Would Differ Over Grade Level Averaged Over all Treatments was Accepted 

The results of a MANOVA comparing grade level with the science variables at 
the .01 level of significance were as follows: F(8, 156)=13.76, p<.001. FRS with re- 
spect to Grade was significant: F(2, 81)=8.89, p<.001. To identify the source of 
variance between grade levels(i.e., K, 3, 6) an ANOVA of FRS with Grade was run. 
The results were as follows: F(2, 87)=5.68, p<.0048. 


Table 5 indicates the means and standard deviations for the grade levels. 


Table 5 
Means and Standard Deviations for Language Used 
During Science for Forecasting/Reasoning 


Over Three Grade Levels 
Grade Mean Standard Variation 
K PSY eo 
3 28.37 13.67 
6 16.10 14.08 


A Pairwise Multiple Comparison test indicated the following results: Kindergar- 
ten compared with Grade Three (FX1.16, p=.2841); Kindergarten compared with 
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Grade Six (F=4.97, p=.0285); and Grade Three compared with Grade Six 
(F=10.925, p<.001). Grade Three was significantly higher than Grade Six at the 
.01 level of significance. 


Hypothesis 2: That the Amount of Language Used for Forecasting/Reasoning 
Would Differ Over Treatment Averaged Over all Grade Levels was Accepted 

The results of a MANOVA comparing Treatment with FRS at the .01 level of 
significance were as follows: F(2,81)=22.72, p<.001. To identify the source of 
variance with respect to Treatment (i.e., Activity, Modeling, Questioning) and 
ANOVA of FRS with Treatment revealed the following: F(2, 87) =18.22, p<.001. 


Table 6 indicates means and standard deviations for the three treatments. 


Table 6 
Means and Standard Deviations for Language Used 
for Forecasting/Reasoning Over Three Treatments 


Treatment Mean Standard Deviation 
Activity 16.20 12.40 
Modelling 18.20 10.98 
Questioning 34.43 14.82 


A Pairwise Multiple Comparison test indicated the following results: Modeling 
compared with Activity (F=.364, p<.55); Questioning compared with Activity 
(F=30.29, p<.001); and Questioning compared with Modeling (F= 24.01, p=.001). 
Questioning was significantly different from both Modeling and Activity, but no 
significant difference was found between Modeling and Activity. It should be noted 
that Canonical Correlation Analysis, the MANOVAS and the ANOVAS were 
convergent on identifying the association between Questioning and FRS. 


Hypothesis 3: That the Amount of Language Used for Forecasting/Reasoning 
Would Differ Across Grade Level Across Treatment was Rejected 

The results of a MANOVA comparing Grade by Treatment interaction for all 
science variables at the .01 level of significance was as follows: F(16,239)=1.41, 
px.1879. FRS with respect to Grade across Treatment was as_ follows: 
F(4,81) =2.48, p<.054. There appeared to be no interaction effect. 


Discussion and Implications 


Developmental Differences in Language Used for Forecasting/Reasoning 


The researcher had suspected initially that students’ performance in using the 
language function of Forecasting/Reasoning would increase with chronological age 
and with the number of years in school, hence an increasing trend would exist be- 
tween kindergarten and grade six. Conversely, the study indicated that students in 
grade six used significantly less language for Forecasting/Reasoning (p<.001) than 
did students in grade three. Further, if the significance level had been stipulated as 
.05, the grade six students would have used significantly less language for 
Forecasting/Reasoning (p<.0285) than students in kindergarten. This finding 
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again raises the issue of competence versus performance. It seems unlikely that the 
12 and 13 year olds were unable to Forecast/Reason as well as the five and six year 
olds and the eight and nine year olds. With persons of normal intelligence the abil- 
ity to solve abstract problems increases with age (Wells, 1981; Piaget & Inhelder, 
1969); and problem solving was the essence of each activity labeled Science which 
was designed to elicit the language function of Forecasting/Reasoning. Perhaps the 
findings of this study do not indicate a problem of competence but rather a lack of 
performance. It seems unlikely, yet possible, that this lack of performance by the 
sixth grade students was caused by an inappropriate task, as the model city activ- 
ity had been pilot tested with numerous children of this age range. Further, stu- 
dents appeared to understand the task and were highly motivated to accomplish it. 
During the activity, sixth grade students used sufficient language, yet it was for 
functions other than Forecasting/Reasoning. Perhaps students of this age are more 
concerned with socializing and this language for socialization overshadows all else; 
or perhaps the very process of schooling in some way inhibits students’ perform- 
ance in Forecasting/Reasoning. The study needs to be replicated and, if similar re- 
sults are found, further research is needed to determine the reason for the six 
graders’ lack of performance. 

Besides the decline in use of language for Forecasting/Reasoning in grade six, 
an additional finding concerning grade level emerged. There was no significant dif- 
ference between Kindergarten and Grade three in the amount of language used for 
Forecasting/Reasoning. Often teachers believe it is unwise to plan problem-solving 
activities for young children (e.g., kindergarten and grade one) as teachers feel 
these students are unable to participate successfully. The findings of this study in- 
dicate otherwise, at least within the context of oral language situations. Many 
kindergartners are able to solve problems and to reason quite effectively if present- 
ed with a situation that they understand. 


Treatment Differences in Language Used for Forecasting/Reasoning 


A second major finding of the study also has direct classroom implications. The 
treatment involving a questioning strategy was significantly more effective than 
either the activity alone or a modeling technique regardless of grade level (p<001 
in both cases when Questioning was compared with Activity and Modeling). Thus 
for both young children and older elementary children the most successful tech- 
nique for eliciting language for Forecasting/Reasoning may be to focus the stu- 
dents’ attention by specific questions within the context of an appropriate 
situation. Questioning strategies need not be a set format prepared in advance of a 
lesson, but may be used as a method for focusing attention on a given aspect of a 
task. Teachers need to internalize the strategy explained in Table 2 so that when 
students are engaged in a problem-solving activity, teachers can circulate and nat- 
urally engage students in talk which is relevant to the problem’s solution. 


The fact that the data was gathered during a single 50 to 60 minute session 
with each triad may have had some bearing on the results of treatment, particu- 
larly with respect to Modeling. The researcher must consider whether or not the 
amount of language used for Forecasting/Reasoning would have increased if model- 
ing were carried out consistently over a period of time. Cazden (1983) has noted 
that models are valuable in helping children learn particular discourse forms need- 
ed in the school setting, but she does not discuss the factor of time needed to effect 
change. 
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Additional Findings 


The fact that the language function of Projecting could not be used in initial 
Canonical Correlation Analysis because language for this function was virtually 
nonexistent, further strengthens the finding that except for the language functions 
of Social Needs, Controlling, and Informing which are present to some degree over 
all classroom activities, language function is highly sensitive to context (Staab, 
1983). This notion was confirmed further by the fact that the art activity could not 
be used as a covariate; suggesting that the language function of Forecasting/Rea- 
soning does not occur over a variety of situations but is elicited by carefully struc- 
tured activities. Activities which elicit Forecasting/Reasoning contain the common 
essence of problem solving. Teachers need to be aware of this essence and to en- 
courage children to problem solve in small groups when appropriate. Teachers can- 
not trust that language for Forecasting/Reasoning will occur naturally in the 
classroom but need to structure problem-solving activities within the context of the 
school curriculum. Teachers also need to internalize the questioning strategy em- 
bodied in Table 2 and be prepared to use it in informal situations as students are 
engaging in these problem-solving activities. 

As students are encouraged to solve problems and as this activity becomes more 
of a reality in elementary classrooms, it is expected that children’s performance in 
the use of language for Forecasting/Reasoning will be augmented. 
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Conservers and Nonconservers: 
How Do Their Composition 
and Story Retelling Abilities Compare? 


The purpose of this research was to examine various language relationships among 
grade two children at two levels of cognitive development (conservers and 
nonconservers). The subjects were selected from four classrooms in one suburban 
Winnipeg school and were pretested with the Goldschmid-Bentler Concept Assess- 
ment Kit—Conservation and then classified as nonconserver, transitional conserver 
or conserver. Each student was given several opportunities to tell a story, to retell a 
story, to dictate a story, and to write a story independently. These productions were 
analyzed with a variety of linguistic fluency and rhetorical quality measures. 
Measurable differences were found between conservers’ and nonconservers’ abilities 
in the four tasks. 


Many children in grade 2 are preoperational (nonconservers) or concrete 
operational (conservers) in their level of thought (Piaget & Inhelder, 1969). The 
majority of children progress from one level to the next during the primary grade 
years. Since each level represents distinct thinking capabilities (Piaget, 1969; 
Piaget & Inhelder, 1969; Goldschmid, 1967), it would be useful for educators to 
know that such cognitive development is identifiable in the language fluency and 
rhetorical quality of the stories children of this age level tell, retell, dictate, and 
write. 

The particular significance of the present study lay in the fact that it addressed 
the controversy (Stein & Glenn, 1978; Brown, 1975; Piaget & Inhelder, 1969) that 
memory rather than comprehension accounts for differences in recall protocols. In 
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this study children were stratified through prior testing of their cognitive levels. 
Research which generated the controversy (Stein & Glenn, 1978; Brown, 1975) had 
studied children of a preoperational age (kindergarten and grade 1), but had not 
actually determined their level of cognitive functioning. In addition to the evalua- 
tion of the children’s cognitive level, the cognitive dependence of the 
preoperational child upon observable stimuli was accommodated through use of se- 
quentially ordered pictures in the story generation task. The pictures were absent 
in the second task, story retelling. In other words, the story generation task ad- 
dressed the issue of the preoperational child’s ability to comprehend sequential 
order in stories and the retelling task measured the preoperational child’s ability to 
remember stories in sequential order. 


Also of significance was the investigation of the effect cognitive levels of 
functioning had on four modes of story-language. Story-language modes referred to 
the tasks of story generation, retelling, dictation, and independent writing. These 
tasks are described in more detail in a later section of this article. Godwin (1963) 
and King (1978) indicated a need to focus research on the four facets of language— 
speaking, listening, writing, and reading. Through the four story-language modes, it 
was possible to encompass these four language arts. Hence this study represented 
an expansion from comparison of two story-language modes (Brown, 1975) and 
three story-language modes (King & Rentel, 1981) to comparison of four story-lan- 
guage modes. 


Its primary purpose was to address the notion that the cognitive level 
(preoperational or concrete operational) of young children influences their ability 
to recall the logical sequence found in stories (Piaget & Inhelder, 1969). The effect 
cognitive level exerted on a child’s story-language behavior was analyzed through 
the child’s ability to generate, retell, dictate, and write stories. 

A secondary purpose was to examine the quantitative and qualitative aspects of 
the language generated in the four modes. This purpose was accomplished by 
measuring the microstructure, or language output, through use of the T-unit and 
dependent clause (Froese, 1984; Loban, 1976; Hunt, 1965), and the macrostructure 
through the use of Rumelhart’s (1975) story grammar (i.e., setting, initiating event, 
reaction, action, and consequence). 


Method 
Research Questions 


Two specific questions guided the research: 


A. Are children classified as conservers able to provide more language output in the 
four modes (story generation, retelling, dictation, and writing) than 
nonconservers? 

ae Are children classified as conservers able to provide a greater incidence of 


specific macrostructures in the four modes (story generation, retelling, dictation, 
and writing) then nonconservers? 


Subjects 

The subjects in this study were 35 grade 2 children from four individual class- 
rooms in one suburban Winnipeg school. Through prior testing of a sampling of 51 
of the 92 grade 2 children with the Goldschmid-Bentler Concept Assessment Kit— 
Conservation—Form A, it was possible to classifv them as 11 nonconservers (Score 
0-6), 12 transitional conservers (Score 7-10), and 12 conservers (Score 11-12). 
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Procedure 


Kach of the conservers and nonconservers provided a total of twelve story-lan- 
guage samples while the transitional conservers provided six story-language 
samples. The transitional conservers were not specifically the object of this study; 
therefore, smaller samples were taken for possible explanatory purposes. The four 
modes of language: story generation, retelling, dictation, and writing were collected 
weekly for three consecutive weeks. The story generation task required each child 
to tell a story based upon a logically sequenced set of 15 black and white pictures 
from one of three textless picture books by Fernando Krahn (1981, 1977, 1974). 
The tape-recorded story was later transcribed. The story retelling task was the 
child’s recollection of the story composed with the aid of the picture sequence. This 
story was retold without the aid of pictures to a second adult who had not been 
present at the child’s first storytelling. The story retelling was again tape-recorded 
for later transcription. The story dictation task was the child’s opportunity to com- 
pose a story as the researcher transcribed it, while the writing task enabled the 
child to compose and transcribe a story. As a result, two stories were produced in 
an oral context while two other stories were developed under the constraints of 
transcription. Moreover, there were four stories collected per week from each child 
classified as a conserver or nonconserver under four different circumstances, and 
two stories per week were gathered from children classified as transitional 
conservers through story dictation and writing. 

Each of the stories was analyzed according to seven measures of linguistic 
quantity and complexity of syntax (also referred to as language output): Total 
Number of T-units, Total Number of Words, Mean T-unit Length, Total Number 
of Mazes, Total Number of Words in Mazes, Total Number of Dependent Clauses, 
and Mean Dependent Clauses per T-unit (Loban, 1976; Hunt, 1965); and five rhe- 
torical quality measures of story macrostructure: Setting, Initiating Event, Reac- 
tion, Action, and Consequence (Rumelhart, 1975). 


Results 
Research Question 1 

Children who were classified as conservers were found to use a greater Total 
Number of Words and Total Number of Dependent Clauses than did children clas- 
sified as nonconservers in the four story-language modes combined. 

The results of the two-way analysis of variance (B.M.D.P. 2V, Dixon, 1982) re- 
vealed that there were statistically significant differences (p<.01) between 
conservers and nonconservers with regard to two measures of language output, the 
Total Number of Words (F[1,21]=10.05, p<.005) and the Total Number of 
Dependent Clauses (F[1,21]=7.61, p<.012) in the four modes (story generation, 
retelling, dictation, and writing) combined. Table 1 presents examples of the com- 
parative mean T-unit lengths for conservers and nonconservers. 

King and Rentel (1981) delineated three steps in the young child’s journey from 
oral to written language with the first being the ability to sustain an utterance 
unsupported by prompting, questions, and feedback. This step related well to the 
Total Number of Words used. Moreover, if children who were classified as 
conservers were able to express utterances of significantly greater length in 


129 


Teresa Hay and Victor Froese 


Table 1 
Mean T-unit Length of Conservers and Nonconservers 
in Four Language Modes 


Mode Conserver Nonconserver 
1. Dictation 9.527 8.817 
2. Independent writing 8.360 7.023 
3. Telling 8.180 VAlS 
4. Retelling 8.253 7.180 


comparison to those used by nonconservers in the four modes of story generation, 
retelling, dictation, and writing, then it would seem that this verbal ability was of 
assistance to the conserver in all four story language tasks and exemplified the first 
stage referred to by King and Rentel (1979). 


The statistically significant difference between conservers and nonconservers in 
the Total Number of Dependent Clauses used in the four modes of story genera- 
tion, retelling, dictation, and writing combined may be considered an indirect 
indication of increased linguistic maturity. Hunt (1965) concluded that the mean 
T-unit length was an index of linguistic maturity which grew in length due to an 
increase in subordinate clauses. It is noted that dependent clauses (subordinate 
clauses) were produced with a statistically significant difference by conservers 
when compared to nonconservers, a situation which may indicate that conservers 
were beginning to be able to consolidate more thought into longer units. Hunt 
(1965) claimed that, “Unless we suppose that there is less thought per word in the 
writing of older students then we must suppose that as students mature they learn 
to incorporate a larger and larger body of thought into a single related organiza- 
tion.” 


The reason that it is suggested that conservers were beginning to consolidate 
more thought into longer units is that there should also have been a statistically 
significant difference between the two developmental groups with regard to the 
measure of the Mean T-unit Length but this was not the case in this study. 
Nevertheless, all means favored the conservers. 


Research Question 2 


The children who were classified as conservers used the story macrostructure 
categories of Initiating Event and Reaction more often in the four story-language 
modes combined (story generation, retelling, dictation, and writing) as compared to 
children classified as nonconservers. Furthermore, conservers used the Initiating 
Event macrostructure category more frequently in the recall task (story retelling) 
as compared with either the story dictation or writing tasks. Nonconservers, on the 
other hand, showed no difference between their use of any of the story 
macrostructure categories in their story retellings as compared with the stories they 
dictated or wrote. 


A two-way analysis of variance (B.M.D.P. 2V, Dixon, 1982) indicated that two 
macrostructure categories, Initiating Event (F[1,21]=7.82, p<.000) and Reaction 
(F[1,21]-8.80, p<.007) were statistically significant for conservers as compared to 
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nonconservers in the four modes combined. Table 2 presents the means for the two 
significant categories. Comparing these findings with those of Stein and Glenn 
(1978), it is noted that in the Stein and Glenn study the Initiating Event 
macrostructure category was the third most readily recalled category, while the Re- 
action macrostructure category, which in the present study was more like Stein and 
Glenn’s Internal Response category, was the fourth most likely to be remembered 
category, and one which fifth graders recalled significantly (p<.05) more often 
than first graders. Since, in the present study, conservers tended to make signifi- 
cant use of Initiating Event and Recreation macrostructure categories in all four 
modes as compared with nonconservers, conservers’ usage of the two 
macrostructures may be indicative of a more advanced level of functioning more 
comparable to that of grade 5 children than to the grade 1 children studied by 
Stein and Glenn (1978). 


way analysis of variance, indicated that there was a statistically significant effect 
due to Developmental level (F[1,21]=7.92, p<.000) and that there was also a 
statistically significant interaction between Developmental level and Mode 
(F[3,63]=6.94, p<.005), the interaction was probed by means of multiple t-tests 
(a=.01). The result of these analyses indicated that, for conservers, the story gen- 
eration and retelling tasks had a statistically significant incidence of the Initiating 
Event macrostructure as compared to the tasks of story dictation and writing 
while, for nonconservers, the incidence of the initiating Event macrostructure was 
statistically significant in the story generation task as compared to the tasks of 
story dictation and writing. In other words, with the use of pictures (story genera- 
tion) both development levels utilized the Initiating Event macrostructure, but 
when the pictures were no longer present (story retelling) the conservers alone 
demonstrated a use of the Initiating Event macrostructure. 


Table 2 
Means and Standard Deviations for the Initiating Event 
Category and the Reaction Category 


CONSERVER NONCONSERVER 
Mode Initiating Reaction Initiating Reaction 
Events Events 
x S.D. xX S.D. xX cl) X S.D. 
1. Dictation 5.00 1.76 3.08 2.07 4.91 3.18 2.82 2.56 
2. Independent otal 1.44 4.08 2.78 3:13 2.53 213 2.05 
writing 
3. Telling 13.25 5.14 3.83 MS 8.73 OEE | 1.64 1.12 
4. Retelling 13.83 7,15 3.33 3.31 6.45 Bene 1.00 1.10 


The previously mentioned finding seems to support the suggestion by Piaget 
and Inhelder (1969) that preoperational thinkers (nonconservers) are stimulus 
bound; that is, when the visual stimuli were present in the story generation task, 
the nonconservers were able to identify and utilize the Initiating Event 
macrostructure category, but when the stimuli were no longer visible, as in the 
story retelling, the nonconservers were unable to utilize this macrostructure cate- 
gory. On the other hand, the concrete operational thinkers (conservers) were able 
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to produce the Initiating Event macrostructure when the stimulus was present in 
story generation and when it was absent in story retelling. Brown (1975) observed 
that recognition of a story through reviewing pictures was an easier task for 
kindergarten children who, it would be presumed, were preoperational thinkers 
than free recall in the absence of pictures, while grade 2 children were able to recall 
call from memory equally as well as they were able to reconstruct or recognize a 
story through use of pictures. Furthermore, in the present study it should be noted 
that use of visual stimuli made a significant difference in the incidence of the 
Initiating Event macrostructure for both developmental levels. The two story tasks 
(dictation and writing) which did not use visual stimuli were shown to have a 
statistically less significant incidence of the Initiating Event macrostructure when 
compared to the task (story generation) which used picture stimuli. This observa- 
tion seems to stress the value of using visual stimuli to develop young children’s 
recognition and use of narrative structure as has been observed by Poulsen, 
Kintsch, Kintsch, and Premack (1979), Brown (1975), and Smith and Bean (1983). 


Conclusion and Implications 


It was found that there was a measurable difference between conservers and 
nonconservers in the stories they generated, retold, dictated, and wrote. 
Conservers, with regard to linguistic quantity measures, used a greater Total 
Number of Words and Total Number of Dependent Clauses than did 
nonconservers in the four story-language modes combined. With regard to 
rhetorical quality measures, conservers used the macrostructure categories of 
Initiating Event and Reaction more often at a statistically significant level as com- 
pared with nonconservers. It was also found that the use of visual stimuli aided 
nonconservers to produce the Initiating Event macrostructure category in story 
generation, but when the visual stimuli were absent in story retelling, the 
nonconservers did not remember the Initiating Event macrostructure category. 
Conservers, on the other hand, used the Initiating Event macrostructure category 
at a statistically significant level in story generation and retelling. 

This study suggests that the story macrostructure categories provide a frame- 
work with which to analyze the children’s stories to determine whether all the nec- 
essary parts of the narrative are present and to what degree each part is present. 
Such analysis indicates the child’s internalized level of story comprehension and 
provides clues as to which structures may require classroom instruction. 


The T-unit appears to be a useful measure for quantifying the microstructure 
of narrative discourse. This measure gives the teacher some insight into the 
structural, linguistic, and rhetorical complexity of the child’s language by which to 
chart individual or group process. It also allows comparisons to be made across 
modes; that is, across the categories of telling, retelling, dictating, and writing. 


Most revealing, however, is the cognitive level of the student. Nonconserving 
second graders in this study performed at levels somewhere between those of first- 
graders (Froese, 1984) and the conservers in this study in their language activities. 
Whether a child is at a concrete operational level or a preoperational level is an im- 
portant variable affecting the child’s language development. Therefore, teachers of 
language arts would be well advised to be sensitive to such differences and consider 
them in their teaching methodology. Concrete mnemonic devices such as pictures 
to aid the writing process may be even more important than was previously real- 
ized. 
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Understanding of Selected 
Pre-Number Concepts: 
Relationships to a Formal Music Program 


Children at prekindergarten and kindergarten levels who were enrolled in a formal 
music program based in part on the philosophies of Zoltan Kodaly were individually 
administered a test containing items relative to the prenumber concepts of compari- 
son, order, classification, and pattern. The test was also given to a control group of 
children. Females in the experimental group who were at the kindergarten level 
attained significantly higher scores than experimental-group males and control- 
group females. 


One of the areas of learning in which some parents have made arrangements for 
their preschool age children to take formal lessons is the field of music. In a num- 
ber of situations, such lessons start as early as the age of three years. In addition to 
the development of concepts in music, it would seem that these children are being 
introduced to concepts which are the foundation for other subject areas such as 
mathematics and reading. 


It is generally agreed that a child’s understanding of number concepts such as 
equivalence, cardinality, number order, and ordinality, are enhanced by previous 
experiences with activities based on the prenumber concepts of comparison, order, 
classification, and pattern. The number concept of comparison is based on an un- 
derstanding that two numbers are either equal or one is greater than the other. The 
prerequisite prenumber concept of comparison can be taught by having children 
compare lengths and thicknesses of wooden dowels. The number concept of order is 
based on an understanding that two or more numbers can be sequenced so that 
each successive number is one greater than the previous number. Experiences with 
sets of wooden dowels of varying lengths and thicknesses are helpful in developing 
a child’s understanding of the prenumber concept of order. The concept of number 
classification is based on the idea that one numeral can represent a variety of phys- 
ical representations. For example, sets of three large marbles, three small boxes and 
a set of one pencil, one eraser and one ruler can all be represented by the numeral 
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“3.” Each of these sets may be classified on the basis of “threeness.” The prerequi- 
site prenumber concept of classification can be taught by having children sort geo- 
metrical figures on the basis of color, shape, or size. Finally, the concept of number 
patterns, exemplified by counting by twos, fives, tens, and so forth , is more easily 
understood if children have prenumber patterning experiences such as adding to an 
established linear sequence of geometrical figures. 


Observation of children who were enrolled in the Alberta College Early Child- 
hood Music Program, which was founded in Edmonton in 1973 by Carmen Bourret 
and based, in part, on the philosophies of Zoltan Kodaly, indicated that many of 
their musical experiences were closely related to three of these four prenumber 
concepts, comparison, order, and pattern. Comparison of sounds is a common activ- 
ity. Is the sound made by a drum or a shaker? Is the shaker made of metal or 
glass? Are there grains of rice or pebbles inside? Children may be asked to compare 
loud and soft sounds, a fast or slow beat, a short or long sound. Examples of tonal 
order are found in Twinkle, Twinkle Little Star (sol-fa-mi-re-do) and in Barley, 
Barley (la-sol-fa-mi-re-do). Pattern activities are so numerous that one can con- 
sider several categories. The songs Rain Rain Go Away, One Two Tie My Shoe, and 
See Saw Up And Down all have the rhythm pattern |||. Bobby Shaftoe, Twinkle, 
and Old MacDonald have the pattern (] (J (|. Meter and emphasis patterns can 
be 3/4 (strong, weak, weak) as in Lavender’s Blue, or 4/4 (strong, weak, medium, 
weak) as the Hot Cross Buns. The tone pattern sol-mi is the basis of Rain Rain. 
Hot Cross Buns is an example of the tone pattern mi-re-do. Since the musical 
activities described provide experiences related to the prenumber concepts of com- 
parison, order, and pattern, it is hypothesized that children who are enrolled in an 
early childhood music program will have a better understanding of prenumber con- 
cepts than children who don’t have these experiences. For the purposes of this 
study, the experimental subjects are children enrolled in the Alberta College Early 
Childhood Music Program. 


Related Research 


Kokas (1969) conducted tests of observation and movement with children aged 
from 3 to 6 years. The experimental music groups made fewer errors than the re- 
lated control groups when they were asked to match animal pictures and names 
and to duplicate patterns of familiar figures. Experimental groups had higher 
scores than control groups in imitating movement, recognizing rhythms of move- 
ment, and making stick designs to imitate movement. None of the differences was 
significant. Hurwitz, Wolff, Bortnick, and Kokas (1975) compared the perform- 
ances of two matched groups of grade one children on tasks of sequencing and 
spatial abilities. Results indicated that the experimental music group had signifi- 
cantly higher scores than the control group on tests of tapping sequences, duplicat- 
ing a sequence of pictures, or colors, and on the spatial tests in the Raven 
Progressive Matrices and Graham Kendall Memory for Designs batteries. Sex dif- 
ferences were also reported in which experimental-group males demonstrated a sig- 
nificantly better level of performance than control-group males. The experimental 
groups in both of the Kokas and Hurwitz studies were drawn from Kodaly-based 
music programs. 

Bain (1978) reported that significant differences favoring experimental music 
group kindergarten age children over control-group children indicated that “early 
music training serves to increase sensitivity to a child’s inner dynamics, to create 
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more flexible means of using the body-as-a-tool to achieve cognitive goals.” He also 
found no differences between experimental and control groups on tests of conserva- 
tion of space, number, substance, continuous and discontinuous quantity, and 
weight. The experimental group in the Bain study was drawn from a modified 
Suzuki-Kodaly music program. 


Pilot Study 


Prior to embarking on the present study, a pilot study (Neufeld, 1984) was con- 
ducted with an experimental group of children who were studying violin with the 
Society for Talent Education, a program based on the philosophies of Shinichi 
Suzuki, and a control group of children who were attending a day care centre. At 
the prekindergarten level, the difference in scores between the two groups on a set 
of items based on prenumber concepts was minimal. At the kindergarten level, the 
difference in scores between the two groups was considerable, favoring the children 
with formal musical experience. There was no consistent pattern of sex differences. 
Although the experimental groups in the pilot study and the regular study were 
drawn from programs based on different philosophies of teaching music, it was felt 
that both programs emphasized concepts which were related to the learning of 
mathematics. 


Administration of the Instrument 

During the months of February, March, April, and May 1984, a test developed 
by the author was individually administered to 130 children. All 30 items on the 
test were based on three-dimensional materials. The test took approximately 20 
minutes to administer. 

The test items were based on four concepts which are generally considered as 
prerequisite to grade 1 number concepts. The instrument consisted of two categor- 
ies for the order concept (length and thickness) and two classification categories 
(one-difference and four-differences). 

1. Comparison (8 items) 
a. Which of these two sticks is longer? 
b. Which of these two sticks is thicker? 
2. Order (6 items) 
a. Which of these three sticks goes in between? 
(2 greater and 2 lesser in order) 
b. Which of these three sticks goes last? 
(4 greater in order) 
c. Line up these five sticks from longest to shortest; thickest to thinnest. 
3. Classify 

a. one-difference (6 items) (shape, size, thickness) 

Which one (of these four) does not belong? (eg., 2 squares/ 
2 circles, 2 large/2 small, 1 thick/3 thin) 

b. four-differences (4 items) (shape, size, thickness, color) 
Which one (of these four) does not belong? (Note: Each one 
doesn’t belong for a different reason.) 

4. Pattern (6 items) (shape, size, thickness) 
Which one (of these three) is the missing one? 
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(eg., a sequence of 4 blocks repeated with one 
block missing) 


Results 


The results summarized in Table 1 are based on 40 children in each of the 
experimental (E) and control (C) groups. The scores of ten children of each sex 
from each of the prekindergarten and kindergarten settings were analyzed. The ex- 
perimental kindergarten group had an average of two years of formal musical expe- 
riences. The experimental prekindergarten had an average of one year of formal 
musical experiences. None of the control-group children was enrolled in a formal 
music program. 


Table 1 
Percentage Scores by Subgroup and Test Category 


Prekindergarten Kindergarten 
Male Female Male Female 
E C E 1c E C E C 
Order 70 63 70 73 83 83 1D 77 
(length) 
Order 47 ‘ay 67 83 13 90 93 90 
(thickness) 
Classify 43 47 ou 49 60 5 65 53 
(1-diff) 
Classify 45 53 45 bo 65 70 93 63 
(4-diff) 
Pattern 60 43 a0 50 40 48 53 40 
Average 52 50 46 56 60 64 2 60 


Initially a total of 112 children were tested. From this group, particular children 
were selected on the basis of similarity of sex and age at the time of testing. The 
scores for each of the categories of the test are reported as average percentages of 
the scores of the ten children in a particular subgroup. The bottom row of the table 
indicates the average percentage based on 22 test items. The scores on the eight 
items covering the comparison concept were not included in the analysis because 
all children answered all items correctly. 

Table 2 shows the ages for subgroups and total groups. The mean age for each 
of the control subgroups is approximately two months greater than for the respec- 
tive experimental subgroups. This relates to the fact that the mean test date for 
the control subjects was approximately two months later than for the experimental 
subjects. 

Treatment differences were analyzed using a one-way analysis of variance pro- 
cedure. For the prekindergarten males, there were no significant differences be- 
tween the control and experimental groups. For the prekindergarten females the 
control group scored higher than the experimental group on all five categories al- 
though none of the differences was significant. 
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Table 2 
Age in Months by Subgroups at Time of Testing 


Prekindergarten Kindergarten 
Male Female Male Female 
E C E C E C E C 
53.0 55.7 55.0 VIA 68.0 70.6 67.0 69.6 


For the kindergarten males, there were no significant differences between the 
control and experimental groups. For the kindergarten females, the experimental 
group scored significantly (p<.01) higher than the control group on the concept of 
one-difference classification. In addition, the experimental group scored signifi- 
cantly’ higher than the control group when the total scores for all five categories 
were analyzed. 


Sex differences were analyzed using a one-way analysis of variance procedure. 
In the prekindergarten control group, females scored significantly higher than 
males on the concept of order by thickness. Females scored higher than males on 
all of the other five categories although none of the differences was significant. In 
the kindergarten control group, males scored higher than females on four categories 
although none of the differences was significant. Males and females had the same 
mean score on one category. 


In the prekindergarten experimental group, males scored significantly higher 
than females on the concept of pattern. On the other four categories males and fe- 
males scored higher, but not significantly, on one category each, and had identical 
means scores on two categories. In the kindergarten experimental group, females 
scored significantly higher than males on the concept of four-difference classifica- 
tion. In three of the other four categories, females scored higher than males al- 
though differences were not significant. In addition, and unlike any of the other 
three groups, females scored significantly higher than males when the total scores 
of all five categories were analyzed. 


Summary 


At the prekindergarten level, the differences between the experimental and con- 
trol groups were minimal. There was no consistent pattern of sex differences at this 
level although the mean scores of control group females were slightly higher than 
the mean scores of control group males in all categories. 


At the kindergarten level, the differences between the scores of males in the 
control and experimental groups were minimal. For females, the scores of the ex- 
perimental group were higher than the scores of the control group in four of the 
five categories. For the concept of four-difference classification and for the total 
score over all categories, the differences were significant. 


There was a consistent pattern of sex differences at this level. For the control 
group, the mean scores of males were higher than the mean scores of females in all 
categories except one, although none of the differences was significant. For the 
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experimental group, the mean scores of females were higher than the mean scores 
of males in all categories except one. Differences favoring females were significantly 
higher for the concept of four-difference classification and for the total score. 


Conclusion 


It is concluded that females who have been in a music program for one year, 
followed by a combined year of music and kindergarten, have a better understand- 
ing of the total set of prenumber concepts than males with similar music and 
kindergarten experiences. It is also concluded that these females have a better un- 
derstanding of the total set of prenumber concepts than females with similar 
kindergarten experiences but no formal music experience. 


At the prekindergarten level it is concluded that there is no difference in the 
understanding of the total set of prenumber concepts between children who have 
been in a music program and those who have not. On the basis of this similarity be- 
tween experimental and control-group subjects, it is concluded that children who 
were given the opportunity to participate in musical experiences did not exhibit 
any advantage in their initial understanding of prenumber concepts. It is further 
concluded that the change in understanding of these concepts was brought about, 
at least in part, by the musical experiences encountered during the complete pre- 
school music program. 

One can only speculate about reasons why a formal preschool music program is 
more beneficial to females than to males in enhancing an understanding of 
prenumber concepts. Perhaps society makes it more acceptable for females to ex- 
press themselves through music than males. When females are given the opportun- 
ity to expand and concentrate this expression in a formal music program which 
emphasizes several concepts that are shared with a prenumber program, then their 
understanding of these shared concepts would appear to be enhanced. 


Note 


al p<.05 unless otherwise indicated. 
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Recent attempts to integrate special education students into regular education in- 
volve considerable risk where deaf children are concerned. Although we do not suggest 
mainstreaming be eliminated, unselective application may be counterproductive to 
the goal of effective education. This article suggests a policy of mainstreaming deaf 
students as an alternative placement in a diverse educational system, a system which 
retains schools for the deaf as a part of a wide spectrum of services. 


Many deaf people and some professionals working with deaf students reject the 
practice of mainstreaming. This article will support this argument, not because 
mainstreaming is undesirable, but because as presently practiced, it offers little 
benefit to many deaf students. Such students are not hard of hearing. They are 
those students with little or no residual hearing, whose disability precedes the 
acquisition, at approximately three years of age, of oral speech and oral language. 
Only one in 10 of these students will be born to deaf parents. The large majority 
will have hearing parents—parents dependent for information about deafness on 
professionals who rarely have reasonable understanding and empathy for the 
status, problems, attitudes, and values of an adult deaf person. These professionals, 
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many of whom are not trained as teachers of the hearing impaired, generalize from 
the experiences of other disability groups and of hearing parents of deaf children. 
They ignore or neglect the point of view presented by the deaf community, a point 
of view related to personal experience, but which has a developing base in quantita- 
tive data. 


Oral Approaches to Education 


The use of speech and lip reading is widely advocated as a method of educating 
deaf students. Deaf people themselves value such skills, so it is important to distin- 
guish between oral approaches to deaf education as a philosophy, and the use of 
such approaches to the exclusion of the effective use of sign language systems in 
classrooms and elsewhere. Indeed, Conrad (1979) discusses the threat that sign lan- 
guage can present to some hearing people. He refers, for example, to the “signing 
alternative, which is then usually suggested as a last resort, [and] is also regarded 
as dangerous, almost hostile; teachers feel superstitiously threatened by its use” 
(p. 310). Quotes are provided to support this hypothesis, a particularly revealing 
one being from van Uden (1970)—“building up a primitive own world view, differ- 
ent from ours.” These fears have not only influenced the education of deaf stu- 
dents, but also other areas of special education. For example, discussing the case of 
Nadia, a young autistic girl whose extraordinary artistic skills were destroyed by 
intensive language training, Lyall Watson remarks: 

Our entire educational effort seems to be dedicated to making others over in our image, 
turning out carbon copy people who faithfully reproduce all our mistakes. “Better,” we 
seem to be saying, “a poor imitation of ourselves, than something different, something 
threatening.” An eminent child psychologist sums up Nadia’s case by concluding that, if 
the loss of her gift is “the price that must be paid for language . . . we must, I think, be 
prepared to pay that price on Nadia’s behalf.” ... Must we? I am far from convinced. 
(1979, p. 34) 

Unfortunately, such feelings can easily be exploited. A disturbing discussion of 
the work of Conrad is found in the Volta Review, a journal read by many parents 
of hearing impaired children. In a section entitled “Forum,” an article by Arnold 
(1983) is headlined “Does Pure Oralism cause Atrophy of the Hearing Impaired 
Child’s Brain?” The title is an oversimplificaiton of a complex scientific issue pre- 
sented as a tentative hypothesis. However, the important point is not the scientific 
question, but its editorial presentation in a manner and form which may play upon 
the understandable fears of parents about the consequences of deafness for their 
child. 

Nash and Nash (1981) bring a note of sense and reality to this discussion by 
analyzing the motivation of ardent adherents of oral only education and of social 
intercourse. “Oralism as a formal system emerged to account for the presence of a 
deaf child in a hearing family in a way that would allow the parents to continue 
their understanding of parenthood virtually unchanged (p. 24). At birth or shortly 
thereafter, a contract is entered into that is supportive of middle class values. 
‘Work hard, your child will become verbally fluent [in speech] and, therefore, 
occupationally and socially successful.” The contract has an additional advantage 
for the professional, as it shifts ultimate accountability to the parent. If the student 
is not orally successful, the home environment is the reason. Again Nash and Nash 
express it eloquently: 
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Thus a circle is established that continually attributes blame for lack of success to the 
parents, who, regardless of how hard they work, fail to produce a “speaking” person like 
themselves. In fact, it can even be suggested that the more work done by the parents, the 
more guilty they feel. Within oralism, of course, the process just described dovetails with 
some parents’ ideas about what might have caused deafness in the first place; for ex- 
ample, “We did something to cause this condition,” or “We must have sinned and are 
being punished.” (1981, pp. 24-25) 

The argument comes full circle, as is clearly shown in the study of Bernstein 
and Henderson (1969). When uninformed hearing people are questioned about the 
consequences of deafness, middle class respondents focused on verbal deficiency 
and working class respondents on thinking processes. Consequently, it is possible 
to speculate that the early class backgrounds of professionals in the field of deaf- 
ness might reveal more about the nature of the debate of oral methods of education 
than arguments about speech processes or the linguistics of sign language. 


The motivation to reinforce sociocultural values which emphasize the impor- 
tance of speech and verbal fluency, and occupations which are highly dependent on 
these attributes, can be contrasted with the much greater need for parents and 
families to reorient the dynamics of the family to permit acceptance and integra- 
tion of the deaf child. Harris (1982) has summarized this process as including: 
(a) the parents’ ability to deal with grief and reach mature acceptance of deafness; 
(b) the availability of family crisis intervention support; (c) the availability of a 
supportive social network; and (d) providing the opportunity to make informed 
choices about communication methods and language development. A focus on 
speech development alone frequently prevents families from dealing with these 
other aspects of their relationship with a deaf child. When this happens, the 
long-term consequences are often tragic. 


Ethnicity and Education 


In developing a policy which limits the choice of deaf students to only main- 
stream placements, many hearing people and educational administrators fail to un- 
derstand that deaf people regard themselves as an ethnic group rather than as 
disabled or handicapped. Ethnicity associates with a separate language, American 
Sign Language (ASL), and with the existence of national and local, social, political 
and other networks. However, the deaf ethnic group differs in one important re- 
spect from most other similar groups—membership by birth is rare (as we noted 
earlier only about 10 percent of deaf children are born to deaf parents). Therefore, 
enculturation by parents is also rare, and indeed, in some respects parents may 
have to be enculturated by their adolescent children, or by deaf adults. Erting 
(1978) emphasized some other significant factors of deaf ethnicity: (a) it is the ina- 
bility to speak rather than the inability to hear to which the stigma of deafness at- 
taches itself; (b) only a minority (approximately 12 percent in the U.S.A. and less 
elsewhere) of teachers of hearing impaired students are themselves deaf; and (c) in- 
deed, training programs for teachers of hearing impaired students have 
systematically discriminated against deaf and hard of hearing applicants. The re- 
sulting educational policies prescribe English, in oral or manual form, as the domi- 
nant language of instruction. 

The parallels between the use of English and the rejection of West Indian Cre- 
ole and the rejection of British Sign Language in schools for the deaf in the UK 
have been explored intensively by Ladd and Edwards (1982). The origins for the 
“creolization” of British Sign Language are found in the verbal and nonverbal cuing 
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which identifies for deaf people and others their low social status, and Ladd and 
Edwards draw comparisons between negative attitudes towards deaf and West 
Indian people, and the derogatory attitudes shown towards their languages by 
teachers and educational administrators. One of the unfortunate effects of this sit- 
uation, according to Ladd and Edwards, is the internalization of these negative at- 
titudes by the minority group. Therefore, “West Indians, particularly the educated 
West Indians, are among the severest critics of their own community’s speech. 
Many deny all knowledge of Creole” (p. 112). 


In the orally dominant schools that have been, and are still a characteristic of 
Great Britain, the students of each school develop their own idiosyncratic language 
structures. Teachers mostly communicate only in a pidgin language, usually oral 
but often supplemented or reinforced by signs. More confusion is engendered by 
adding two distinct forms of manually coded English into the British educational 
system—the Paget Gorman Sign System and Signed Systematic English. However, 
at no point have teachers incorporated into the classroom the formal sign language 
of the adult deaf community, which has primarily been retained in the institutes 
(clubs) for adult deaf people. As a result “most deaf people are forced to recreolize 
BSL in every generation” (p. 112). The situation in Canada is similar, but less ex- 
treme. 


There are severe educational consequences resulting from the attitudes 
discussed in the previous paragraphs. Both West Indian and deaf students are se- 
verely retarded educationally, and such retardation results, at least in part, from an 
educational system that is totally predicated on the use of standard English. To 
support their contention, Ladd and Edwards (1982) cite some interesting 
parallels—(i) when using their own language the students are perceived to be less 
intelligent and badly behaved; (ii) the students are described as “nonverbal and in- 
articulate,” but teachers also complain that they never stop talking in their own 
language; (iii) standard English is the model of use and success, and the Creole is 
often banned in the classroom; (iv) when the Creole language is used, attempts are 
made to ensure that it is used “properly” which means that it is distorted to con- 
form to the syntax of spoken English; and (v) comments are made about the “lack 
of spontaneity” in written expression without associating this problem with 
hypercorrection of spontaneous language structures. 

Of particular significance is the effect of the values just described on academic 
learning. Learning becomes the process of correction of incorrectly produced 
speech patterns or manual English, not a process of assimilation and comprehen- 
sion. The constant focus by the teacher on English results in “a child who reads for 
accuracy of pronunciation and not for meaning” (p. 119). The student is not given 
the opportunity to develop his spontaneous understanding of the grammatical rules 
of language. Artificial language structure completely dominates and, as a result, 
language function is lost. This limited language function then reduces the student’s 
ability to benefit from academic and other forms of instruction, and by exposing 
the child to an English language dominated education, mainstreaming may increase 
rather than reduce his/her ability to benefit from formal instruction. 
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Educational Policy and Mainstreaming 

Kaufman, Gottleib, Agard, and Kuric (1975) state: “Mainstreaming refers to the 
temporal, instructional and social integration of exceptional children with normal 
peers. It is based on an ongoing individually determined educational needs assess- 
ment, requiring clarification of responsibility for coordinated planning and pro- 
gramming by regular and special education administrative, instructional and 
support personnel” (p. 40-41). The idea is not new in the education of deaf stu- 
dents. 


Johann Baptist Graser (1766-1841), a Bavarian educator, introduced one of the 
first formal attempts at mainstreaming deaf children into regular schools. In 1821 
he established an experimental school for the deaf within a regular school in 
Bayreuth. Its purpose was the gradual placement of deaf children in regular classes, 
and the teachers were to be trained to teach both deaf and hearing pupils. The con- 
cept was enthusiastically accepted and put into wide practice in many public 
schools throughout Germany. However, it went out of fashion when it was observed 
that deaf children failed to maintain the same progress as their hearing peers 
(Bender, 1981). In Canada, at the turn of the century, an early experiment placed 
deaf students in a classroom with hearing students in a regular industrial training 
school. Unfortunately, we do not know if the experiment was successful as the 
school, the Fredericton Institution for Education of the Deaf and Dumb in New 
Brunswick, closed, and the students were.transferred to the Halifax School for the 
Deaf in Nova Scotia (Rodda, Ellis, & Chaddock, 1983). Other early unsuccessful ex- 
periments are fully reported by McLaughlin (1980) and Pritchard (1963), and Brill 
(1978) describes more recent methods used for the selection and placement of 
mainstreamed students. 


If students are appropriately placed in mainstreamed settings, there can be ad- 
vantageous results, but when mainstreaming reflects a desire to assimilate rather 
than integrate and thereby deny the experience of deafness, the practice can and 
does become counterproductive. “The rationale of any mainstreaming program 
must consider the child first. A child’s communicative needs must be satisfied. The 
child [then] provides the rationale for placement within a setting consisting of 
others with similar communicative preference. The setting is [then] viewed as a 
normal and appropriate environment” (Stewart 1984, p. 101). 


Mainstreaming is often an extension of the values associated with the extreme 
oral position. In particular, it frequently rejects the use of sign language, and even 
when it does not, the mainstream situation provides few opportunities for deaf 
children to use sign language intensively or in a variety of different settings 
(Carver, 1984). A language does not make a culture or a subculture, but it is an im- 
portant aspect of it. Similarly, a culture or subculture does not make a person, but 
it is an important aspect of his/her development. As Barbara Kannapell has said: “I 
believe my language is me. To reject a language is to reject the person himself or 
herself” (1980, pp. 11-12). 


The common North American model for mainstreaming is the Cascade system. 
In Canada, much of the impetus for this system came from Wolf Wolfensberger’s 
classic book, Normalization, published in 1972. Wolfensberger’s main thesis urged 
the acceptance of handicapped people as equals rather than advocating a particular 
educational policy, but it was widely interpreted as supporting mainstreaming. If 
we contrast the diagrammatic representation of the Cascade system with the less 
well-known model developed in Britain by Gunzberg and Gunzberg (1973), an in- 
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teresting difference emerges (see Figures 1 and 2). The Cascade system is invari- 
ably drawn in an administrative or structural mode. Where the student fits in the 
system is of lesser importance than his level of achievement in educational and so- 
cial skills. The regular class merges into residential provision and then into a hospi- 
tal setting. The Gunzberg model is drawn in a developmental and functional mode. 
The two are not, of course, necessarily opposed, but they can be if we assume that 
a hierarchically structured pattern of services enables us to make judgments about 
“the least restrictive environment” required by classic legislation mandating main- 
streaming, such as PL 94-142 in the USA and Bill 82 in the Province of Ontario. 
The least restrictive environment is then defined by the match between students 
and the programs of the institution; by arbitrary rather than predefined criteria. 

Possibly, the influence of the Cascade model has lead many to conclude that 
schools for the deaf are institutions and, as such, are undesirable. Both premises 
are suspect.’ Miller and Gwynne (1973) in their seminal study A Life Apart, con- 
cluded that institutions using a “warehousing” ideology, were based on humanitar- 
ian values; and institutions using a “horticultural” approach were based on liberal 
values; but both provided a mechanism to avoid confrontation with the feelings of 
staff about the perceived inadequacies of the inmates. They suggest that the pro- 
cess of admission or discharge (intake or graduation in schools), places the recipi- 
ent of services in a passive role. The staff regard the students, residents, or inmates 
as throughput, limits to be slotted into a flow diagram. Miller and Gwynne argue 
that a major shift is needed, in which the students become part of the human re- 
sources, which are focused on a work oriented throughput. Instead of focusing on 
helping students or clients to achieve their potential, values should be shifted 
toward outlining specific remedial or educational objectives. The student or client 
becomes an integral part of the process by which these objectives are achieved, not 
a passive recipient of services. Without these changes, 

membership of the institution prescribes the dominant role as physical dependency— 
which in the warehousing setting extends to complete psycho-physical dependency. This, 
on the one hand, preempts the possibility of conflict with other roles and, on the other, 
entails delegation to the institution of many of the decisions that the able-bodied adult 
would have to make for himself. The resulting pattern of behavior is not one that is likely 
to foster the development of a mature personality. (Miller & Gwynne, 1973, p. 160) 
As Miller and Gwynne contend, if neither humanitarian nor liberal values are ap- 
propriate, then possibly programs for deaf students and their parents, should be re- 
placed by alternatives developed by the families themselves and by the deaf 
community. 

This confusion of values is a severe problem, further complicated by the process 
of selection of students for mainstreaming. Bunch (1977) outlines the development 
of a mainstreaming program in British Columbia as follows: “Neither the govern- 
ment nor its critics could suggest acceptable and useful methods of assessing which 
children were suitable candidates for mainstreaming and what degree of support an 
individual mainstreamed child required” (p. 11). In fact, the reliability of the tests 
described by Bunch is too low to permit their use in selecting individuals unless 
one is prepared to accept a very high number of false positives and false negatives. 
In particular, he discusses two of the instruments developed for making these selec- 
tions in deaf education, the Transitional Instrument for Selection of Hearing 
Impaired Students for Integration (Rudy & Nace, 1973); and the Mainstreaming 
Placement Question/Check List (Nix, 1977). Bunch rightly criticizes the weighting 
and scoring of these tests, and comments on areas of assessment that are included. 


145 


Michael Rodda, Carl Grove, and Barbara M. Finch 


Hospital Provision 
or Long Term 
“Treatment” 


Hospital School 
Greater 


Special Residential 


School (varying types) 


Special Day School 


Full-Time Special Class 


B 
Specialized Needs 


Movement of Student 


Combination of Special 
and Regular Class 


Regular Class with Resource Room 


Regular Class with Itinerant 


or School Based Consultants 


Regular Class with 
Advice/Help available “on call” 


Regular Class Placement 


7 Relative Size of Program 


(Not to Scale) 


Figure 1: The Cascade System of Special Education (Modified from Deno, 1968) 
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There are also a number of concerns about the validity of the selection criteria 
used for placing students in mainstream programs, particularly as these criteria 
tend to reinforce existing biases within the educational system. Such reinforcement 
means that the student selected for mainstreaming will be the academically higher 
achieving student, with good language and communication skills, who is socially 
well integrated, of higher intelligence, and who is drawn from a supportive and in- 
tact home environment (Karchmer & Trybus, 1981). Hearing loss will be measured 
as much as any other factor (Ross, 1976), although unpublished research by Rodda, 
Grove, and Denmark question this finding when the sample is limited to severely 
prelingual deaf students. The effect of this selection process is to remove from the 
mainstream of deaf education the less educationally handicapped deaf child, and 
leave behind the more severely handicapped students. These students who could 
not or would not be “normalized” are considered “failures,” which can be devastat- 
ing to their self-concept (Carver, 1984). They are placed in special classes and lose 
contact with the good role models provided by successful deaf children. Their self- 
esteem is likely to diminish further (Greenberg & Doolittle, 1977), and indeed, al- 
though only approximately ten to fifteen percent of all deaf children can be suc- 
cessfully mainstreamed, schools for the deaf and special class placements are 
viewed as places for “mainstream failures,” not as appropriate settings catering to 
the individual needs of many deaf students. The same view is shared by students in 
training as teachers (Curtis, 1985), who seem to be only “marginally committed to 
mainstreaming,” and then only if they are supported by appropriate specialist staff. 

In its anxiety to respond to a clearly articulated demand, the educational sys- 
tem has generally failed to adequately plan the whole spectrum of services and pro- 
grams which are needed if mainstreaming is: (1) to be a successful facet of a total 
system; (2) is to retain and advance the support which students receive from group 
identity and role modeling in the deaf subculture; and (3) to enable integration to 
be a two-way process, hearing students and deaf students integrated with each 
other. If mainstreaming is to be successful, the mainstreamed student should be 
prepared and readied for the rigors of the regular classroom placement (Salend & 
Lutz 1984). “If the necessary classroom design adaptations that optimize student 
performance are not incorporated into the regular classroom milieu, the 
mainstreamed setting can be a highly restrictive environment for the handicapped 
student” (Salend, 1983 p. 56). 

To rectify the defects described in the previous paragraphs will require a funda- 
mental reassessment of the place of deaf culture and, in particular, of sign language 
in the curriculum of the regular school. “Mainstreaming would then place the child 
in the program which best met his communication needs.” (Stewart, 1984, p. 102). 
In particular, deaf students placed in regular schools must have experiences that 
relate realistically to their needs, but mainstreaming as presently practiced, asks 
the student to adopt only oral communication methods, or limits his manual com- 
munication to interaction with only a very small and highly selected group of fellow 
students and staff. It asks the deaf child to accommodate to the hearing culture. Of 
course, all children, hearing or deaf, have to learn to accommodate to the wider so- 
ciety, but accommodation presupposes a similarity of social background and cul- 
tural experiences. Most hearing people and students do not have this empathy for 
and understanding of the deaf student, and rarely is it provided in the mainstream 
setting. The consequences of this lack of understanding and rapport are an increas- 
ing number of mainstream failures. 
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Bilingual Education 


Charrow has undertaken two major studies of deaf students and the use of Eng- 
lish as a foreign language (Charrow & Fletcher, 1974, Charrow, 1976). Using the 
Test of English as a Foreign Language, Charrow and Fletcher concluded English 
seemed to be processed as a second language by deaf students. In a later study, 
Charrow concluded that ASL did not interfere with language acquisition in deaf 
children, but rather their English reflected the particular difficulties faced by them 
in mastering oral language. Ivimey (1976) has similarly concluded that “deafisms” 
are systematic variations of English structure, for example, the use of adverbs or 
adverbial phrases to delineate time with a consequent failure to use verb tense for 
this purpose, and plurality is determined by subject determiners rather than by 
plural inflections. These examples indicate the influence of the spatial attributes of 
ASL on English, and reflect the difficulty the deaf child has acquiring rules based 
on phonetic characteristics, characteristics he/she is unable to hear or adequately 
process using vision. 
In contrast to the bilingual environment of their home, Erting (1978) has de- 
scribed the situation which faces deaf children of deaf parents when they enter a 
school or classroom: 
Typically, these deaf children enter a monolingual classroom in which some variety of 
signed English is the only language used, and the only language manually presented that 
is understood by the teachers ... The educational goal for these children is mastery of the 
signed English variety of PSE (Pidgin Signed English), and their communication is 
judged on the basis of the appropriateness of its grammatical form to English rather than 
on its content. From all this the message is clear: Their home, their family, their commu- 
nity, their language, and they themselves are not valued—they must change to become 
acceptable. (p. 143) 

The situation for deaf children of hearing parents is no different in the classroom, 

but of course such children are in an even more difficult situation since they rarely 

have a well-developed language system in either English or ASL. 


An interesting study of ASL/English bilingualism is that of Prinz and Prinz 
(1981, 1979). These authors studied a hearing female child, Anya, the offspring of a 
deaf mother and a hearing father. They noted that “babbling” occurred in sign lan- 
guage as well as spoken English, and the first sign emerged “several months before 
the first spoken words.” Babbling in both languages was combined at an early stage 
of language development into oral or manual strings, which later associated with 
the development of combinations of oral and manual symbols to allow the child to 
communicate. The observations on the development of a lexical system showed, 
“lexical acquisition in ASL and spoken English progresses through similar stages,” 
and the results of this study seemed to confirm (Volterra & Taeschner, 1978) that 
early bilingualism initially results in the formation of one lexical system into which 
entries are made separately from both languages. The study also confirmed the re- 
sults of Stokes and Menyuk (1975). In the early bilingual acquisition of ASL and 
spoken English, a construct is encoded in one modality or the other, but only rarely 
in both modalities. Later this single lexicon separates; when this separation takes 
place, the use of alternate codes begins, and signs and words exist for the same con- 
cept. At this point, code switching becomes possible and is seen in the communica- 
tive behavior of the child. In the mainstream situation this code switching and the 
use of manual communication will be of no use to the child because of his/her 
placement in a monolingual setting—cognitive strategies and coping skills are ren- 
dered almost entirely useless, but we still persist in regarding unselective 


149 


Michael Rodda, Carl Grove, and Barbara M. Finch 


mainstreaming as a viable alternative for all deaf children without taking into ac- 
count their competence in English. 


Role of Parents in Mainstreaming 


Mainstreaming is an excellent example of the importance of parents as 
determinants of educational policy. In the USA, such demands culminated in 
PL 94-142 mandating the education of handicapped students in the “least restric- 
tive environment.” However, despite recent changes in educational policy, the 
paternalism of professionals towards parents still exists. Schlesinger & Meadow 
(1972) document this phenomenon, and Harris (1983) gives a highly personal ac- 
count. The values parallel paternalism to deaf people and deaf children. Procedural 
protections which exist for parents can reinforce this attitude; often PL 94-142 is 
circumvented. As Garretson (1981) points out, the deaf child spends eight percent 
of his 8,760 hours per year in the classroom, but school systems have effectively in- 
sulated themselves from the full effects of PL 94-142 by defining appropriate as 
meaning “not inappropriate” (Large, 1980). This can lead to a situation where the 
provision by a school district of only one form of mainstreamed education is viewed 
as acceptable, and the parents of deaf children are forced to accept the local pro- 
gram, irrespective of its suitability for their child (Greenberg & Doolittle, 1977). 
Large (1980) crystalizes the dilemma: “Congress and the commentators have 
blithely assumed that regular placements were always more beneficial. Parents and 
teachers, who have to deal with the practical aspects of handicaps after the theore- 
ticians finish congratulating themselves, are not so sure... Parents fear that their 
children will lose an effective education to be part of an ill-advised experiment in 
socialization,” (p. 270) and “Research suggests that the success of the mainstream- 
ing effort is often dependent on the quality of communication and support between 
these two previously distinct educational delivery systems” (Salend, 1984), the local 
school and the central campus or residential school. 


In the education of deaf students, the issue of mainstreaming is complicated by 
the fact that a primary effect of the handicap is to interfere with the communica- 
tion process between deaf and hearing people. In this sense, deafness differs from 
other handicapping conditions. It also differs as deaf culture is stronger and more 
established than other subcultures of handicapped groups, and there is consider- 
able attachment to the subculture on the part of deaf adults. One effective way for 
a deaf student to obtain a positive self-image and to aspire to accepted values is 
through the support and role modeling which is available from deaf adults. This 
situation has long been recognized, yet Higgins (1980, p. 64) discusses the continu- 
ing and pervasive influence of authorities such as Alexander Graham Bell, and his 
considerable concern to prevent deaf people maintaining their subculture. Bell 
opposed residential schools, sign language, school reunions, adult deaf societies, 
and publication of the literature of the deaf community. 


Given the attitude to deafness which characterized education in the past, and 
its replication in many areas of modern practice, it is not surprising that deaf 
people feel threatened by mainstreaming. Many social and political achievements 
of the deaf have come through a concerted and collective action on their part, not 
through the efforts of hearing society. The deaf community is an important and 
vital agent for changing the status of deaf people, and if destroyed, would probably 
regenerate naturally. Successful mainstreaming of deaf students needs the coopera- 
tion of deaf people. They are supportive of its general principles, but also 
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understand how difficult it can be to be “Outsiders in a Hearing World” or to be 
enclosed in a “Plate Glass Prison.” Extreme neo-oralism and mainstreaming can be, 
and often are, the ultimate rejection of deaf experience, culture, and language, and, 
therefore, of deaf students and adults themselves. 


Future Policy and Final Conclusions 


Anderson (1973) offers some prescriptions for future policy on mainstreaming. 
Her study of physically handicapped children receiving education in ordinary pri- 
mary schools in England was exceptionally searching and she concludes that “given 
special provisions . . . it is possible to offer even severely handicapped children a 
satisfactory education in an ordinary school.” She considers, however, that “for cer- 
tain children special schools may always be needed, but as the provision of special 
educational treatment within ordinary schools increases, this need is likely to be 
smaller than at present.” She suggests three main functions for the special schools: 

The first is that they should be used increasingly as observation and assessment centres. 
Secondly they should provide children (from as early an age as possible) with an initial 
period of skilled and intensive training by teachers and therapists working together. . . . 
After this training period, which might for some children last from one to four years, 
many children would be ready for transfer to a special or ordinary class. Ideally the spe- 
cial school would be sited close to an ordinary school so that for at least part of the time 
handicapped and non-handicapped children could be taught in mixed groups. Thirdly, 
special schools could act as advisory centres from which . . . experienced teachers and 
therapists could travel as consultants to teachers in special and ordinary classes who need 
help. (p. 302) 

At present, the large majority of hearing impaired children “track” for the whole 
of their school careers in one type of setting, unless they fail to integrate in adoles- 
cence, in which case they often transfer to a residential or day special school. 
Movement between different settings can often be difficult or impossible, but as 
students’ needs change, a formerly appropriate environment may become inappro- 
priate. Consequently, integration will not occur simply because of proximity, and to 
ignore the problems of the handicapped child in coping with his/her handicap is to 
ignore all we have learned about attitudes and the need to change community val- 
ues if positive attitudes are to be developed. Deaf students can have a special ad- 
vantage over other handicapped groups, the support of and membership in the deaf 
community. By failing to accept the experience of the deaf community, present 
mainstreaming practices are doing little, if anything, to remedy the real deficiencies 
in the education of deaf students. Such students are leaving schools with very poor 
skills in basic competencies, and there is an epidemic of emotional/behavioral prob- 
lems in the school-age deaf population (Schein, 1981). In advancing an uncritical 
commitment to mainstreaming, we are proffering an administrative solution with 
no real base in clinical practice or educational services, and a solution with little 
potential for solving the major problems confronting our educational sytem. 


Note 


il This article grew out of Dr. Rodda’s activities as a McCalla Research Professor in 1983-84. 
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PERSPECTIVES 


Applications of New Directions in Statistics to 
Educational Research 


Thomas O. Maguire 


University of Alberta 


Some recent developments in applied statistics are described and examples of their 
uses given. The data used in the examples were taken from a study involving class- 
room interactions in fourth grade mathematics classes. The utility of stem and leaf 
displays, box and whisker displays and transformations is illustrated. Two develop- 
ments in inferential statistics, the bootstrap technique and randomization tests are 
presented. The first is a strategy for developing confidence intervals in situations 
commonly encountered in education, that is, where no random selection has occurred 
but statistical generalization is desired. The second is useful for testing the 
hypothesis that data show trends that are plausibly random. 


The use of statistics in educational research has a history that covers more than 
50 years. Improvements over that time have usually been in the direction of in- 
creased complexity and mathematical sophistication. Many have been made pos- 
sible through the development of the digital computer. Recently some new 
developments in statistics have occurred which are beginning to attract the atten- 
tion of social science researchers. Unlike previous trends these developments are 
characterized by increased simplicity and direct involvement of the researcher with 
data. 

The principal purpose of the present paper is to describe some recent develop- 
ments in applied statistics and to illustrate their use with data collected in an edu- 
cational setting. Prior to doing this, it is useful to review the current orthodoxy— 
and then to suggest in general how the recent work differs from the traditional in 
terms of philosophy and approach. 


Dr. Maguire is with the Department of Educational Psychology specializing in measurement and research methods. 
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Background 


Conventionally, statistical analysis has been treated in the texts as a prescribed, 
often monolithic activity. Proposers of research are encouraged to specify in ad- 
vance the statistical procedures to be used, and deviations from these, especially 
deviations that occur as a result of seeing the data as they accumulate, are often re- 
ferred to as data snooping and are usually discouraged. 


Analyses are seen to be divisible into two parts: description (presentation and 
summarization) and inference (hypothesis testing and confidence interval construc- 
tion). There is a tendency in most texts to concentrate on the form of the analysis 
at the expense of the purpose of the analysis in the context of the entire research 
enterprise. For example, data presentation is often covered by bar graphs, 
histograms, and frequency polygons, but seldom is any attention given to why a re- 
searcher should use one or other of these procedures, and what would result from 
the examination of data presented in these ways. 


In 1977, Tukey’s Exploratory Data Analysis appeared. It was a compilation of 
many ideas that he and others had developed and it represents a change in the way 
in which we look at data analysis. The thrust of the book is that statistical analysis 
ought to be a creative, reflective activity that contributes both to discovery and 
confirmation. To do so requires approaches that encourage the researcher to adjust 
the analytical strategy to fit the circumstances of the research. In addition re- 
searchers are exhorted to make extensive use of their data, and far from being dis- 
couraged, data snooping is seen as a valued activity. 


According to Tukey and others the field of applied statistics relates to three 
functions of research: exploring, justifying, and archiving. Data exploration involves 
the interaction between the research and the results. As Good (1983) points out, 
the techniques of data exploration should be designed to fit the salient features of 
the data to human cognition. Since patterns are important, data exploration should 
emphasize graphical analysis and de-emphasize formulas and numbers. There 
should be a concern with exceptions, since they often stimulate theoretical develop- 
ment. Concentration on summaries should not be made at the expense of interest- 
ing exceptions. Justification usually involves a process of informed debate (e.g., 
researcher-reviewer). Consequently clarity of presentation, estimation of uncer- 
tainty, accuracy of calculation, and consideration of alternative explanations are 
important characteristics of analytical strategies. Placing data in archives for future 
use by other researchers requires attention to efficiency of storage, simplicity of de- 
posit, and ease of retrieval. 

In the remainder of the paper, applications of recent developments from the 
two broad areas of statistics are presented. From descriptive statistics some ex- 
amples of exploratory data analysis are provided. Edgington’s randomization test 
(1980) and Efron’s bootstrap technique (1982) are two examples of recent work in 
inferential statistics. The data used in all examples were collected by Macdonald 
(1972). 

As a part of his doctoral research, Macdonald observed pupil teacher interac- 
tion behaviors in seven fourth grade mathematic classes. The Flanders interaction 
analysis categories (1970) were used for this purpose, and the six categories of be- 
havior used in this paper are shown in Display 1. Each class was observed for 20 
hours, and teacher behaviors were noted together with the student to whom they 
were directed. Approximately 70% of the interaction could be described as 
occurring between teachers and individual students, and it is this dyadic behavior 
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that is used in the analyses. Data for students who missed more than five classes 
were dropped. The results were prorated for students who missed fewer than five 
classes. 


Data Exploration 


In most studies of this sort the first step in the analysis would be to summarize 
the data by calculating means and standard deviations. These are shown in 
Display 2, as they would be found in the print-out of a typical computer program. 
The large number of decimal places that have been retained makes it difficult to 
see interesting trends. Perhaps a better way to examine the data would be to ap- 
proach summarization in smaller steps and begin by simply organizing the data in 
a systematic fashion. One procedure that can be easily done either by hand or ma- 
chine is the stem and leaf display. In Displays 3 and 4, stem and leaf displays of 
variables for Classes 6 and 7 are shown. 


Display 1 
Flanders Categories Used in Macdonald’s Study 


Teacher Talk 


1. Praises or encourages. Praises or encourages pupil action or behavior. Jokes that re- 
lease the tension but not at the expense of another individual; nodding head or saying “um 
hm” or “go on” are included. 


2. Asks questions. Asking a question about content or procedure, based on teacher ideas, 
with the intent that a pupil will answer. 


3. Lectures or explains. Gives facts or opinions about content or procedures; expressing 
his own ideas, giving his own explanation, or citing an authority other than a pupil. 

4. Criticizes or justifies authority. Statements intended to change pupil behavior from a 
non-acceptable to an acceptable pattern; bawling someone out; stating why the teacher is 
doing what he is doing; extreme self reference. 


Pupil talk 


1. Pupil response. Talk by pupils in response_to teacher. Teacher initiates the contact or 
solicits pupil statement or structures the situation. Freedom to express own ideas is lim- 
ited. 

2. Pupil initiation. Talk by pupils which they initiate. Expressing own ideas; initiating a 
new topic; freedom to develop opinions and a line of thought, like asking thoughtful ques- 
tions; going beyond the existing structure. 


A stem and leaf display is a way of tabulating data that combines the visual 
characteristics of a histogram with the ability to recapture each of the data values. 
The stem is the value that appears to the left of the colon, the leaves appear to the 
right. To recapture an original value you simply attach the leaves to the stem. For 
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Display 2 
Means and Standard Deviations of 
Interaction Variables in 7 Classes 


MEANS 
Class 1 2 3 4 5 6 
il 9.2727 58.1818 14.3636 9.5455 75.5455 27.0909 
2 12.5909 46.7727 Oleni27, 7.8636 38.1364 19.2273 
3 16.0385 47.0385 29.9231 3.9615 45.5769 28.3077 
4 12.7826 42.6957 35.0870 14.0870 71.8261 44.3478 
5 10.8571 39.4643 13.1786 6.2143 42.8571 32.7143 
6 4.6429 27.0000 16.9643 4.2143 18.0000 27.1429 
7 11.8571 62.6786 25.5000 10.75.0000 78.0000 25.6429 
SDEVS 
Class il 2 3 4 5 6 
1 5.7618 25.8345 10.8357 9.6471 22.2929 15.6812 
2 4.1193 23.8116 37.9736 7.7887 22.9749 15.3769 
3 10.4236 42.6294 26.7336 4.4070 38.2111 21.8505 
4 3.9668 24.8394 24.2503 12.2932 31.9233 sayy le} 
5 6.5776 23.5653 15.5404 5.8393 27.0776 18.7880 
6 2.9425 10.1242 9.7266 4.8724 8.0401 18.3978 
7 4.3566 35.6431 16.5497 13.1084 32.7850 18.1348 


Variable 1 Teacher Praise 
Variable 2 Teacher Questions 
Variable 3 Teacher Explains 
Variable 4 Teacher Criticizes 
Variable 5 Pupil Response 
Variable 6 Pupil Initiation 


example, looking at Display 3, teacher questions, Class 6, the first row is 
1: 01457799. This means that one student was asked 10 questions, one was asked 
11, and so forth. The original data for this row were 10, 11, 14, 15, 17, 17, 19, 19. In 
Display 4, for the variables praise and criticize, there are two rows for each stem. 
Leaves from 0 to 4 are in the first row and leaves from 5 to 9 are in the second. For 
comparing the two classes, this tends to spread the scores out a bit and highlights 
the differences in the distributions for these variables more clearly. Of course it 
does make it more difficult to compare the distribution of Praise with that of 
Explains. Portrayals almost always involve judgment, and more than one portrayal 
may be needed to bring out the important characteristics of the distribution. 

As noted earlier, the purpose of the analysis is not to build pictures; rather it is 
to aid in understanding the data. In Displays 3 and 4 there are many things to no- 
tice but the restrictions of space in the present paper limit the discussion to two 
points. First, the stem and leaf displays show a large variation existing both be- 
tween and within classes. Researchers often talk about a fourth grade mathematics 
class as though it were a coherent, unitary entity. Stem and leaf displays quickly 
demonstrate that this simply is not so. For most of the variables, Class 6 exhibits 
much less interaction than Class 7. In particular, the number of questions asked in 
Class 6 ranges from a low of 10 to a high of 50. The median value is about 25. In 
Class 7 the range is from 13 to 205 with a median of 54. The second teacher not 
only asks many more questions than the first teacher, but the distribution of ques- 
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tions to various students is much less consistent. It seems clear that the class envi- 
ronments must be very different. 


A second thing to note is that often data like these (counts, times, etc.) have 
distributions that are peculiar. The fact that they are bounded on one end by an 
absolute zero seems to encourage a trailing off or skewing toward the other. In 
Class 7, one person is asked 205 questions in 20 hours. Such abnormal points can 
be a profound influence on summary statistics, and unless the distribution is exam- 
ined, the problem may go unnoticed. For the example, the correlation over the 28 
students between teacher quetions and teacher explanations is .7]. If you exclude 
the person who was asked 205 questions, the correlation drops to .51. 


Having organized the data into stem and leaf displays, and having made note of 
the shapes of the distribution, the researcher will likely need to pause and reflect 
on the possible reasons why different shapes emerge. Some distributions have dis- 
tinct lumps; others have a uniform spread of values across the range. In each 
stituation, the purpose of the data portrayal is to stimulate questions in the 
researcher’s mind. Some questions may be answered by further analysis while 
others may lead to further research. 


The stem and leaf display retains all of the information in the original data ex- 
cept for the identity of the individuals. After the data have been examined, it is 
usually useful to proceed to some sort of summary. Means and standard deviations 
are attractive when they have one mode and are reasonably symmetric. They give 
the reader a sense of central tendency and of spread. An alternative approach that 
accomplishes the same thing visually is the box and whisker display. Examples for 
Class 7 are shown in Display 5. 


The ends of the boxes are placed at the first and third quartiles. The line 
through the middle of the box is the median. The whiskers join the boxes to the ex- 
treme scores, except in cases where the extreme scores fall more than one and a 
half box lengths beyond the upper or lower quartile. Such scores are marked with 
asterisks and are called outliers. Students whose scores fall at the extremes have 
been labeled. For example, pupil 19 is asked the most questions, makes the most 
responses, and receives more explanations than any other student. 


If the data have been organized into stem and leaf displays, the construction of 
a box and whisker display is a simple matter requiring the researcher to count in to 
the quartiles and the median. When distributions are highly skewed, medians and 
interquartile ranges are usually more useful indices of central tendency and disper- 
sion than means and standard deviations. Whether working by hand or with a com- 
puter they are easy to calculate. The box and whisker display essentially uses five 
numbers to summarize the data: lowest value, first quartile, median, third quartile, 
and highest value. Generally speaking the visual representation of these five 
summary values gives the researcher more insight than tabulating the two 
summary values (mean and standard deviation) such as is done in Display 2. 


Class 7 is characterized by a lot of questioning and answering. These medians 
are much higher than those of the other variables. There is very little praise and 
criticism, and few children initiate interactions. Having said this, it is important to 
note that the class experience is different for different students. For example, 
pupil 19 is in action far more than other children. One wonders what happens on 
the days that pupil 19 stays at home? What is the effect on the teacher’s strategies 
of having a child like this in class? Perhaps something significant could be learned 
about teaching and learning if a case study were conducted in the interaction be- 
tween this pupil and teacher. 
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Steam and Leaf Displays 
for Teacher Questioning and Student Responding 
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Display 4 


Stem and Leaf Displays 
for Four Interaction Variables 


(Stem: tens 
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Display 5 
Box and Whisker Display of Class 7 Interactions 
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Procedures such as stem and leaf displays and box and whisker displays help 
researchers to see some of the most important features of the data such as the gen- 
eral level of the scores, the spread of the scores, and exceptional values. Sometimes 
there are interesting points that are not shown by these portrayals, and data trans- 
formations can be used to bring them out. 

Data transformation is a topic that receives little play in the methodological lit- 
erature, and even less play in research reports. Generally transformations are men- 
tioned in two contexts: linear transformations to standardize scores, and nonlinear 
transformations to satisfy assumptions required by inferential statistics. There are 
two other reasons for transforming data that are just as important. They are trans- 
formation to match our conceptualization of a variable, and transformation to ac- 
centuate regions of a score distribution. In data exploration, transformations are 
often used for one of the latter reasons. 


In examining Macdonald’s (1972) data, some researchers may decide that the 
incremental effect of teacher behavior on students acts in proportion to the amount 
of interaction that has already occurred. For example, if there are two students and 
one is asked 200 questions in 20 hours and the other one is asked 20 questions in 20 
hours, an increase of five questions would likely have far less influence on the first 
student than on the second. It may seem reasonable that a 25% increase has the 
same impact so that five questions for one student is equivalent to 50 for the other. 
If this is so, then an equal interval scale of influence would be the log of the fre- 
quencies rather than raw frequencies. This would be an example of transforming 
data to match a reconceptualization of the variable. 


The other reason for transforming data is to accentuate parts of the distribu- 
tion that do not show up very well. As noted earlier when the data are composed of 
counts they are often skewed. Plotting techniques usually draw attention to the 
upper outliers. If we transform the data to make the distribution more symmetric, 
then the eye is not drawn to the upper end at the expense of the lower end. 


To illustrate, a log transformation was placed on the data from Class 7. The re- 
sults for each variable were then standardized by subtracting the median and di- 
viding by the interquartile range. This made it possible to compare variables. A box 
and whisker display is shown in Display 6. 


In examining Display 6, the reader should realize that two transformations have 
been carried out. First, the data were transformed to logs which meant that propor- 
tionate differences were made into equal intervals. Secondly, the logged data were 
transformed so that the interquartile ranges for each of the variables were made 
equal. The second transformation was done so that the exceptional scores would 
have an equivalent impact. For example, the second lowest score on explanations 
(P20) is twice as far below the median in interquartile units as the lowest on praise 
is from its median (P5). 

Regardless of the units however, the log transformation brings to the reader’s 
attention pupil 20. In some respects this child is as exceptional as pupil 19, except 
that neither the researcher nor the teacher would likely have spotted it. Again a 


case study of this exceptional student might tell us much about what happens in 
classrooms. 


There are many other exploratory strategies that can be used to examine data, 
and the interested reader will find Tukey (1977) an excellent starting point. 


162 


New Directions in Statistics 


Bootstrap Confidence Intervals and Randomization Tests as Approaches to 
Statistical Inference 


Broadly speaking, statistical inference strategies fall into two categories: 
hypothesis testing and confidence interval construction. In many cases it is possible 
to move back and forth between the two in a mathematical sense, but there is a 
conceptual distinction. Hypothesis testing requires the specification of a point 
hypothesis which is the focus of a decision. Usually in the behavioral sciences we 
aim to reject the hypothesis in order to support our theories. Thus hypothesis test- 
ing is logically tied to theory construction. When a researcher constructs confidence 
intervals, the goal is to estimate some characteristics of a population from informa- 
tion collected on a sample. There need be no theory involved at all. 


Campbell and Stanley (1963) distinguished between the internal validity of 
studies and the external validity. Roughly speaking internal validity refers to the 
ability to attribute reasons or causes to particular outcomes. External validity re- 
fers to the ability to generalize results to other groups and other circumstances. 
Bracht and Glass (1968) examined the notion of external validity and broke it 
down into population validity—the ability to generalize results to other people or 
samples, and ecological validity—the ability to generalize results to other circum- 
stances. Inferential statistics is seen by many writers (including Bracht & Glass) 
as working in the service of population validity. The methods of statistical infer- 
ence allow us to make statements about populations based on observations made 
on samples. 

In many of our statistics courses we fail to put inference in its proper 
perspective. Students have difficulty grasping the logic, and there is a tendency to 
accept inferential models uncritically. This leads them to behave as though popula- 
tion validity were entirely a statistical issue. 

Inference is essentially a process of reasoning and debate. The scholarly com- 
munity examines the quality of arguments, and knowledge emerges through a pro- 
cess of critical analysis of evidence and implication. Statistical inference has no 
special status, but it does have particular power to convince when the circum- 
stances of the study are working favorably. Unfortunately, models for statistical in- 
ference reflect the affairs only approximately. The principal operating assumption 
that helps us to go beyond the data is that there is a random sample from some 
population. Given this, a little logic, some algebra, and a good salting of the calcu- 
lus of probability, the inductive leap is made. In practice, the random selection as- 
sumption is almost never met in educational research. Our samples are usually an 
available or interesting group who have either given their consent to be studied, or 
had it given for them. 

Even knowing this, there is something about the consistency of human behavior 
that suggests that the findings from one sample extend beyond the boundaries of 
the study. Extensions are rationalized by saying that the observed data behave as 
though they were a random sample from some hypothetical population. In other 
words, samples represent something beyond themselves, and so tests are per- 
formed, or confidence intervals calculated in the hope that they will give us some 
idea of the uncertainty of the findings. 

Since in most educational research the sample studied is unlikely to be a ran- 
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Display 6 
Class 7 Standardized Log Frequencies 


2- 
*P19 
she 
ae eo + P20 “P20 
*P20 
4 
*P26 


asTeig 
suotisenh 
suot j eueTdxq 
ustOFIFAO 
asuodsey Trdng 
uofyeyapuy Trdng 


164 


New Directions in Statistics 


dom sample of any population, the aim should be to find samples that represent 
populations of interest. They should be well described, and the internal variation 
should cover the most interesting uncertainties in the population. How much an in- 
terpretation is trusted will depend as much on knowledge of the context (i.e., 
knowledge of the entire area being examined) as on the probability of type 1 error. 


Efron’s (1982) bootstrap technique provides a partial answer to the problem of 
nonrandom samples. It can be used to describe the uncertainty that arises in edu- 
cational studies. The notion is simple—create a hypothetical population that is an 
infinite number of replications of the sample that you have, then draw random 
samples of individuals from the hypothetical population and create an empirical 
sampling distribution of the statistic of interest. The empirical random sampling 
distribution gives us an idea of the kind of sample variation that we might expect 
from a population of which our sample is perfectly representative. It gives a con- 
servative estimate of the amount of sample “wobble” in the data. 


Macdonald’s data can be used to show how the bootstrap works. In Display 7, 
the median values for each of the classes on the six variables are shown together 
with the correlations which were calculated over medians between pairs of vari- 
ables. Medians have been used to represent the activity in each class because they 
are insensitive to outlying values, and in many of the classes there were one or two 
exceptional scores that could have influenced a less robust statistic. Correlations 
between -.4 and +.4 have been excluded, and the variables have been clustered to 
make reading easier. 


Since the correlations have been calculated on class medians, they are an 
indication of the relationships that exist among the general level of the variables 
over classes. For example the correlation of —.6 between explanations and pupil ini- 
tiation suggests that classes that have relatively high amounts of pupil initiation 
tend to have relatively low amounts of teacher explanation. Three overlapping clus- 
ters of variables emerge from the table: questions, responses, and criticism; criti- 
cism, praise, and explanation; and explanation and pupil initiation. While the 
correlations are based on only 7 classes, there is a sense that they should be fairly 
robust since 7 medians ought to have less uncertainty than 7 individual student 
scores would have. The bootstrap can be used to estimate the uncertainty. 

To see how the bootstrap works, consider teacher explanations and pupil initia- 
tions. The medians for seven classes have been plotted and around each class a 
cloud of points has been drawn. This is the hypothetical population which is per- 
fectly represented by our sample. In other words, a hypothetical population is 
conceptualized which has an infinite number of members exactly like the medians 
of Class 1, an infinite number of members exactly like the medians of Class 2, and 
so forth. The population would look like Display 8. It is a population that 
Macdonald’s sample exactly represents. To produce an empirical sampling distri- 
bution, 7 classes are now drawn at random from the hypothetical population. 
Practically speaking this is done by randomly taking 7 points with replacement 
from the set of 7 paired values that relate teacher explanations to pupil initiation. 
This process could yield three sample values exactly like Class 1, one like Class 3, 
one like 5, and two that look like 6. The correlation between the teacher explana- 
tion and pupil initiation in this bootstrap sample is —.68. 

This procedure is repeated many times and each time a correlation is calcu- 
lated. In this example 1000 replications were carried out. Macdonald’s 7 classes is 
one sample from the hypothetical population shown in Display 8. 'The correlation 
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in the “population” between explanation and initiation is -.6. just as it is in 
Macdonald’s sample. The distribution of 1000 sample correlations is shown in 
Display 9. 

Ninety-five percent of the observed values fall between -1.00 and .85. This is a 
very wide interval indeed, and it tells the researcher that Macdonald’s sample of 7 
medians contains a fair amount of sampling uncertainty. Indeed, with these data, a 
researcher cannot even be sure that the sign of the relationship would replicate in a 
future study carried out with a similar population. 


The shape of the distribution is very interesting. There are two parts: one in the 
vicinity of -.6 and the other around +.6. This is another example of the sensitivity 
that correlations have to odd points in the data. In Display 8, Classes 7, 3, 6, 1, and 
5 fall on a line that drops to the right. If these are the only classes in a bootstrap 
sample, then the correlation will be about -.87. If Class 2 is included in the 
bootstrap sample, the correlation jumps to about —.99. Taking out Class 2 and put- 
ting in Class 4 changes the correlation to about +.49. The positive bootstrap values 
come from samples that exclude Class 2 and include Class 4, while the negative val- 
ues near —.1 include Class 2 and exclude Class 4. 


This particular example is noteworthy because not only does it illustrate the 
procedure, but it suggests that an inferential technique can be used in data explo- 
ration. In this case we are reminded that the classes are not “well behaved.” Even 
when going to median scores, we see that the notion of a “typical grade 4 math 
class,” is strained. 


Usually statistical inference is thought of as being a servant of external validity, 
and most parametric tests are used in this role. In non parametric statistics the 
hypotheses being tested are often related to issues of internal validity and in that 
way are qualitatively different from those tested by parametric strategies. 
Edgington (1980) and many others point out that exact tests can, in principle, be 
calculated to test the hypothesis that a particular partitioning of scores differs 
from that expected under random allocation. For example, 10 people could be ran- 
domly assigned to each of two groups, a treatment carried out on one of the groups, 
and a post treatment test administered. The post test yields a particular 


Display 7 
Class Medians 
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Display 7 continued 
Correlations of Class Medians 
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partitioning of the scores between the two groups. Using the difference between the 
means to represent the treatment effect, the exact sampling distribution could be 
calculated under the null hypothesis that the observed statistic is merely the result 
of a random partitioning of the 10 scores. In ordinary languge this is evidence rela- 
tive to the question, “Did a treatment effect occur?” or, synonymously “Is the ob- 
served partition substantially unlike that which we would expect through chance 
allocation of people to groups?” 


This is conceptually different from the logic inherent in the t-test (or the confi- 
dence interval), wherein the researcher asks if the observed difference is so big that 
we can reject the null hypothesis of no treatment effect in the population. In the 
former case, the notion of a random sample need not enter the picture. The infer- 
ence is not made about a population of people; it is made about a population of 
possible events. 


Edgington showed that instead of enumerating the chosen statistic for all pos- 
sible partitions, the researcher could evaluate the observed statistic against a large 
sample of values calculated from randomly generated partitions based on a true 
null hypothesis. Even in the case where there is no random assignment of people to 
groups, the randomization test can be used to evaluate an observed difference 
against a model of random allocation. 


Suppose that we were interested in the amount of pupil responding in Classes 6 
and 7. We can see at a glance that there is more responding in Class 7 than in 
Class 6, but is the difference in variability unusual? Or put another way, is the dif- 
ference significantly greater than what we would expect from a random allocation 
of scores to the two groups? 


One measure of variability is the interquartile range. For Class 7 this value is 
36, and for Class 6 it is 13. The difference is 23. Each of the classes has 28 children. 
A sense of the importance of this difference between interquartile ranges can be 
obtained by using some of Edgington’s ideas. Disregarding class membership, the 
56 scores are randomly assigned to two sets, the interquartile ranges from the two 
sets calculated and the resulting difference recorded. This process is repeated many 
times and the distribution of differences produced. The result for 500 replications 
is shown in Display 10. 


The distribution of differences between interquartile distances shows that when 
the null hypothesis is true (i.e., the interquartile ranges are equal) the empirical 
sampling distribution falls between -40 and +40 points. Ninety-five percent of the 
differences between interquartile ranges are less than 28 in absolute value. This 
suggests that the observed difference in interquartile differences (23) is not signifi- 
cantly different from that which we would expect by a random allocation of scores 
to classes. 


There is not likely a statistical test for the significance of the difference be- 
tween interquartile ranges. Indeed, for many statistics that are useful in data ex- 
ploration, standard inferential strategies are unavailable. The randomization or 
permutation test provides the researcher with a conceptual tool for testing 
hypotheses in such circumstances. The hypothesis—that the observed result differs 
systematically from a random allocation of data—often provides a useful base from 
which results can be evaluated. 
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Conclusion 


The field of statistics has not been stagnant. There are many new develop- 
ments, but the most important ones to educational researchers are those that en- 
courage people to custom design their analytical strategies to fit the problems 
under investigation. These are tools to be used to find patterns, to spot important 
exceptions, to add strength to hunches and to confront researchers with the 
inadequacies of their theories. The aim of this paper has been to bring some of 
these developments to the attention of researchers and to encourage them to exam- 
ine both the strategies and the philosophy behind them more carefully. 


Note 


iN Parts of this paper are based on the author’s McCalla Research Professorship Lecture, January 31, 1985. 
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student evaluation as stated in Recommendation 4 (p. 49). If the recommendations 
of this report can filter down through the administration morass to the district and, 
more importantly, the school and teacher level, so that action can be taken to in- 
clude all the players including students, parents, and prospective employers, per- 
haps both the educational and public perception problems associated with student 
evaluation can be approached and solved in a rational and constructive manner 
(see particularly Recommendations 1 and 2). 

I was slightly disappointed in that there were no recommendations directed 
towards teacher training institutions and preservice education of teachers since 
many statements in the report seemed to indicate weaknesses in these areas. How- 
ever, for the strong and positive recommendations that it does contain, this report 
is to be highly recommended and Dr. McLean should be commended! 


David J. Bateson, 
Educational Research Institute of British Columbia 


174 


PREPARATION OF MANUSCRIPTS 


1. All manuscripts must be typewritten, double spaced, and submitted in 
triplicate and are not to exceed 25 pages in length. 

2. An abstract of approximately 100 words in length, typed on a separate page, is 
to be included. 

3. The author’s name and affiliation should appear only on a separate title page 
to ensure anonymity in the reviewing process; the title of the paper should ap- 
pear on the first page of the manuscript as a means of identification. 

4. Tables must be numbered in Arabic numerals with the word “Table’ centered 
and in capital letters, e.g., TABLE 4. The heading of the table is to be centered 
below and typed in capitals. The format of tables should conform to the 
specifications in the APA Publication Manual. 

5. Graphs and charts should be used only if essential. They must be carefully pre- 
pared on separate sheets in India ink, ready for reproduction. Graphs must be 
properly labelled using Arabic numerals, e.g., Figure 3. 

6. Each table or figure should be presented on a separate page. The position of 
tables and graphs should be clearly indicated within the text by inserting at 
the relevant point the phrase (Insert Table 2 here). 

7. Sources should appear in parentheses following the reference, citing the 
author’s name (unless the name appears in the text), the year of publication, 
and page number if appropriate. For direct quotations, the reference should be 
cited and the page number given in brackets before the final punctuation of 
the quotation. The references should be listed alphabetically by authors’ last 
names at the end of the manuscript under the heading, References. 


8. Explanatory notes, numbered consecutively and identified in the text with a 
superscript, may be included under the heading of Notes. They should be 
double spaced and placed at the end of the manuscript immediately preceding 
the References. The citing of references and quotations in the Notes should 
conform to the procedures outlined in No. 7 above. 

9. Spelling shall conform to Webster’s New Collegiate Dictionary, except in the 
case of direct quotations which must conform to the original. Editorial altera- 
tions will be made if necessary. 


10. In matters of style, the third edition of the APA Publication Manual is consid- 
ered definitive. 


The views expressed and the accuracy of the statements made are the responsibil- 
ity of the individual authors. The editor is solely responsible for the editorial 
comments. 


AJER is indexed in the Alberta Education Index, Canadian Education Index, 
Current Contents/Social and Behavioral Sciences, ERIC/Current Index to Jour- 
nals in Education, and Social Science Citation Index; appropriate articles are ab- 
stracted in Cognitive Development Abstracts, Educational Administration Ab- 
stracts, Educational Technology Abstracts, Language Behavior Abstracts, Multi- 
cultural Education Abstracts, Psychological Abstracts, Research into Higher Edu- 
cation Abstracts, School Organization and Management Abstracts, and Sociology 
of Education Abstracts, and reviewed by the Journal for Research in Mathematics 
Education. 


THE ALBERTA JOURNAL O 
EDUCATIONAL RESEARC 


VOLUME XXXII NUMBER 3 SEPTEMBER 15 


PUBLISHED BY 
THE UNIVERSITY OF ALBERTA @ EDMONT 


THE ALBERTA JOURNAL OF EDUCATIONAL RESEARCH 


Published in March, June, September, and December by the 
Faculty of Education, University of Alberta 


A quarterly journal devoted to the dissemination, criticism, interpretation and en- 
couragement of all forms of systematic enquiry into education and fields related to 
or associated with education. 


EDITOR: Walter H. Worth 


CONSULTING EDITORS 


Reno A. Bosetti, 


Alberta Department of Education 


Robert C. Bryce, 
University of Regina 

Robert K. Crocker, 
Memorial University of 
Newfoundland 

Kathy G. Dueck, 
University of Calgary 

Michael G. Fullan, 
Ontario Institute for Studies 
in Education 

Myrna L. Greene, 
University of Lethbridge 

Geoffrey B. Isherwood, 
McGill University 

Joanne C. Keselman, 
University of Manitoba 


Terry R. Morrison, 

Athabasca University 
Alan R. Pence, 

University of Victoria 
Bikkar S. Randhawa, 

University of Saskatchewan 
C. Jim Richardson, 

University of New Brunswick 
W. Todd Rogers, 

University of British Columbia 
Jaap Tuinman, 

Simon Fraser University 
Donald J. Willower, 

Pennsylvania State University 
John D. Wilson, 

Moray House College of Education 


FACULTY ADVISORY COMMITTEE 


Thomas O. Maguire Erwin Miklos (Chair) 


Alton T. Olson Daiyo Sawada 


AJER gratefully acknowledges support from the Social Sciences and Humanities 
Research Council of Canada and from the Alberta Advisory Committee for Educa- 
tional Studies. 


The subscription rate is $24.00 per year for individuals, $32.00 per year for institu- 
tions. Add $7.00 for delivery outside of Canada. Single copies are $7.00 each. Please 
make cheques payable to The Alberta Journal of Educational Research. All back 
issues are available; rates supplied on request. Claims for undelivered copies must 
be received within three months of publication. 


Address all communications and manuscript submissions to The Alberta Journal 
of Educational Research, Faculty of Education, Publication Services, 4-116 Educa- 
tion North, University of Alberta, Edmonton, Alberta, Canada, T6G 2G5. 


SECOND CLASS MAIL REGISTRATION NUMBER 1436 


ISSN 0002-4805 


The Alberta Journal 
of 
Educational Research 


Volume XXXII, Number 3 September, 1986 
CONTENTS 

THE EDITOR’S PAGE: Research Impotence: Blessing or Blight?...................... 175 

LOGO and the Transfer of 

PiOvlem solving eA Na MpIricAl.| est avait, ca + Gest aee, St, Ae ee 176 


J. Mitterer and L. Rose-Krasnor 


Pactoraelated.to Cursive: WiTitine Skillet 4... oceista. choc ch ee ae ee eee 195 
H. Ratzlaff and D. Armitage 


Two Teachers’ Perspectives on Social 
Sridies Leacning in Prencm Immersion: ..6,..,.-.:0..5)82 9. ae eee 201 
Y.T.M. Mahé 


Critical Incidents in the Supervision of 

Student Teaching: A Cluster Analysis of the 

Perceptions.ol.araduate.. ranster Students) .6 .- . 48) ede CO ee eee 212 
W.B. Boldt and B. Housego 


Precocious Readers: A Comparative Study of 

Early Reading and Non Early Reading 

PCat ater, FUN th ANTS ere ce fe ee Aan’ aio m casi duh Ce 8 Gi 8 eae TE ae 223 
J. Grant and C. Brown 


Selected Job Stressors, Emotional Exhaustion, 

Job Satisfaction, and Thrust Behavior of the 

Pe MS CHOOLETINCIDAl oe aad eh oe eck hs AOR POs S45 EA Jha ne ae Oe ee nee 234 
R.B. Kottkamp and A.L. Travlos 


PERSPECTIVES 


Making Research Matter! irvins ase saad iii hae ee od ee ae 249 
E.A. Holdaway 


FACULTY OF EDUCATION 
The University of Alberta 


Asy 2 itty oo Wijek =. 5 


ib? 


Uses a nese nv uP ™ 4 cis 


RR 


a, 
ey! 


. 
| 4 cea 
i : i eee 
4 J av oe hie “a! 
+ bei Gew 
fi y \ 


« 
i ¢ ew te 
= 
s: 
? 
Aye & 
' “ 
7a 
7 gar (Tie 5 We Avs 
> » 7 ‘het? _ 
se” il ae io a 
if , ae 
; ‘ore » 7 a 
| i - wa j . : 4 . 
in| rare 7 
= 


" | [ 
A ai... ee pee 


top « 


| re 


The Alberta Journal of Educational Research Vol. XXXII, No. 3, September 1986, 175-175 


THE EDITOR’S PAGE 


Research Impotence: Blessing or Blight? 


Holdaway’s article in the Perspectives section of this issue reminds us that 
much of the research in education has not been a potent source of enlightenment or 
change. This impotence may be more of a blessing than a blight! 


Many research studies are “single-shot” efforts by students seeking a graduate 
degree, or professors pursuing career advancement. These ventures tend to focus on 
“targets of opportunity” based on the researchers’ idiosyncratic perceptions of sig- 
nificant educational problems, available time, funds, and talent, and access to data. 
The conceptual frameworks which guide the investigations are not readily appar- 
ent. Data collection is usually emphasized more than data analysis. Contributions 
to theory are rarely assessed. Suggestions for further research, when offered, are 
seldom followed up either by the authors or the readers. 

The result is a “shotgun effect”—a scattering of fragmented information about 
inconsequential matters. This kind of “buckshot” is such a poor guide to action in 
the classroom or the boardroom that we ought to feel relief, rather than regret, that 
it is so seldom used. Otherwise, researchers might be the scapegoats of our educa- 
tional system and educational research might enjoy even less support than it does 
now. 

Is this too cynical and harsh an indictment? Probably. But it does suggest the 
desirability of some modification of current research patterns and traditions. 

One place to start is with summary or interpretive investigations designed pri- 
marily for the scholarly community. These would analyze the rationales of recent 
research studies about various phenomena and seek to identify ways of overcoming 
their conceptual weaknesses while building on their strengths. Such analyses could 
facilitate more meaningful research activities rooted in carefully examined and 
elaborated conceptual frameworks and planned in relation to the larger educational 
issues they may potentially influence. 

Another is to move beyond superficial triangulation in multimethod research 
toward interrelating the analysis of quantitative and qualitative data in ways which 
will increase the trustworthiness of research findings. A 1986 Sage publication, 
Linking Data by Fielding and Fielding, is illustrative of attempts to achieve such 
meaningful convergence. 

Still another starting point is for more groups of professors in education to en- 
gage in thematic longer-term collaborative research and to encourage graduate stu- 
dents to relate their theses to these undertakings. This practice is as uncommon in 
education as it is common in many other disciplines. The outcome could be a criti- 
cal mass of research evidence which would inform policy and practice to replace the 
many snippets of unrelated information currently being produced. 

Articles reflecting any or all of these approaches would be welcomed, as would 
one which challenges the views expressed herein. 


WHW 
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LOGO and the Transfer 
of Problem Solving: 
An Empirical ‘Test! 


A study was conducted to investigate the possibility that learning to program in 
LOGO would result in an improved ability to solve other noncomputer problems. 
Ninety-six grade six children were divided into four groups (LOGO, BASIC, problem 
solving, and classroom control). All children were tested on a variety of problem 
solving tasks before and after training. Experimental groups were given 40 hours of 
small-group instruction and the children were observed for task-relevant behaviors. 
These process-oriented observations indicated groups differed in the quality of task- 
oriented behaviors. Very little LOGO-specific change in problem solving behaviors 
occurred, however, leading to the conclusion that learning LOGO will not enhance 
general problem solving skills, at least under the conditions of the current study. 
Based on these results, it is also concluded that much more theoretical and empirical 
work will be required before it will be clear just what conditions (if any) will foster the 
transfer of problem solving skills. 


One of the most influential positions concerning how to use computers in 
schools is that taken by Papert (1980), and his colleagues (Abelson & DiSessa, 
1981). Papert offers a vision of the use of computers in schools in which “the child 
programs the computer and, in doing so, both acquires a sense of mastery over a 
piece of the most modern and powerful technology and establishes an intimate con- 
tact with some of the deepest ideas from science, from mathematics, and from the 
art of intellectual model building”(p. 5). This vision, as embodied in the computing 
language LOGO, has captured considerable attention in the educational commu- 
nity. Among the features of Papert’s work which have aroused the greatest excite- 
ment have been his claims about “powerful ideas.” These ideas about thinking are 
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encountered while programming LOGO and then become available for use in other 
intellectual endeavors: “What matters most is that by growing up with a few very 
powerful theorems, one comes to appreciate how certain ideas can be used as tools 
to think with over a lifetime” (p. 76). 


The educational implications embodied in such claims are far-reaching, 
mirroring advantages promised in previous years by those in favor of teaching 
Latin, mathematics, logic, and writing systems for the improvement of general cog- 
nitive skills (Pea & Kurland, 1984). For this reason alone, it is important to be con- 
fident that such claims rest on solid foundations. Unfortunately, although Papert’s 
ideas possess considerable intuitive appeal, they are not yet fully articulated and, 
perhaps as a consequence, are not yet well supported either by theoretical argu- 
ments or empirical data. This fact, which has led Pea and Kurland (1984) to label 
Papert’s ideas “technoromanticism,” has recently led to the emergence of a back- 
lash against the earlier enthusiasm with which LOGO was greeted (e.g., Leousis, 
1985), including a call for a moratorium on work with LOGO (Tetenbaum & 
Mulkeen, 1984). However, it is probably too early to render definitive judgment on 
such an important position and, as such, it will be more useful to concentrate on 
articulating the theoretical and empirical issues involved. 


Krasnor and Mitterer (1984) and Gallini (1985) have pointed out the relevance 
of the extensive literature on the transfer of cognitive skills from one domain to an- 
other. In essence, Papert (1980) may be interpreted as suggesting that program- 
ming is a form of problem solving, that “powerful ideas” are general problem 
solving heuristics which can be learned by learning to program in LOGO, and that 
those powerful ideas can be transferred across different types of problems, includ- 
ing those not involving computers. As Krasnor and Mitterer (1984) have shown, the 
considerable literature concerning problem solving leans to the view that there is 
no expertise without considerable domain-specific knowledge, and that expertise in 
one domain does not guarantee expertise in other domains (e.g., Simon, 1980). In 
this view, it is quite possible that learning to program in LOGO will not foster ex- 
pertise in other domains. Further, this literature suggests that if general problem 
solving heuristics are acquired, they tend to be “weak” rather than “strong”; that is, 
they will generally not be sufficient by themselves for problem solution. Thus even 
if learning LOGO does result in the appreciation of some powerful ideas, this does 
not guarantee the manifestation of enhanced general problem solving ability. 

Not only do the above arguments reveal little strong theoretical support for 
Papert’s (1980) position, empirical support is still limited as well (see Krasnor & 
Mitterer, 1984; Pea & Kurland, 1983 for reviews). Krasnor and Mitterer pointed 
out that most of the evidence available at that time was inconclusive and of an an- 
ecdotal nature. Early studies generally suffered from methodological shortcomings 
such as small sample sizes and/or incomplete designs. While subsequent research 
on LOGO has generally used improved methods, what little has been directed at 
the transfer problem has yielded mixed results. Pea and Hawkins (1984) found that 
learning to program in LOGO did not improve children’s ability to plan how to 
carry out a sequence of chores (see also Pea, 1982). Gorman, Jr. & Bourne (1983) 
showed that children given an extra hour of individual computer time made signifi- 
cantly fewer errors to criterion on a set of rule-learning problems than did a group 
given only one half hour extra computer time. Clements and Gullo (1984) found 
that six-year-olds showed improved reflectiveness, divergent thinking, ability to de- 
scribe directions, and metacognitive skills as a function of learning LOGO. 
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In summary, while the position adopted by Papert (1980) is an appealing one, it 
is not yet well supported by either theoretical arguments or empirical data. The 
current study was designed to gather evidence relevant to the claim that teaching 
children to program in LOGO can enhance their ability to solve problems in other 
domains. It implements several features designed to overcome earlier 
methodological shortcomings, including a larger sample size, random assignment, a 
complete design including controls for learning programming and problem solving 
per se, and a variety of problem solving tasks. Further, this study was deliberately 
designed to be implemented within the resource limits currently placed on most 
schools. As such, the results can inform teachers interested in implementing 
Papert’s ideas in today’s classrooms. 


Method 


The main goal of the current study was to investigate the effects of learning 
LOGO on a variety of social and cognitive tasks. The scope of the investigation was 
limited to a single age group of 11-12-year-olds and used a “standard” play-oriented 
version of the LOGO curriculum as described in most published accounts (e.g., 
Papert, 1980). This age group was selected on the grounds that children who are at 
the stage of formal operations could best benefit from this instruction (Krasnor & 
Mitterer, 1984). The effects of LOGO were contrasted with two alternative teaching 
approaches (one using a computer and one without a computer) and a classroom 
control condition, using classic problem solving tasks. 


One computer-oriented group learned LOGO; the other computer group learned 
BASIC. A comparison of the BASIC and LOGO groups allows us to detect differ- 
ences between the effects of learning a structured language (LOGO) and a 
nonstructured, linear language (BASIC), while controlling for computer exposure. 
The noncomputer training group received a modification of a well-known problem 
solving curriculum (the Productive Thinking program). This group was included to 
help clarify the effects of training which was specifically directed at teaching prob- 
lem solving without any computer experience. (All children participating in the two 
noncomputer groups were given instruction in LOGO after the posttesting in order 
to be fair to all subjects.) 


Theoretical analysis of the nature of programming tasks and anecdotal studies 
led us to predict that the problem solving benefits of learning to program should be 
apparent in-the following problem solving dimensions: (a) efficiency; (b) flexibility 
in thinking; (c) integration of task-relevant variables; (d) planning and task- 
orientation; and (e) socioemotional aspects, including attitudinal and motivational 
components. 


Subjects 


Four grade six classes in four different elementary schools were involved in the 
study. Children who were eligible to participate if they and their parents gave per- 
mission, were between 11 and 12 years of age and had minimal prior programming 
experience. From those students meeting the criteria, 24 subjects (12 girls and 12 
boys) were randomly selected from each class. Within each class, children were ran- 
domly assigned to one of four training conditions: LOGO programming, BASIC 
programming, noncomputer problem solving, and classroom control. Across the 


schools, therefore, there were 24 children (12 girls and 12 boys) in each of the four 
conditions. 
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The Peabody Picture Vocabulary Test (PPVT) was individually administered 
prior to the training sessions and provided an index of receptive vocabulary. The 
four groups did not differ significantly on PPVT scores. 


Measures 


Pre- and posttest measures were given during the two-week period which pre- 
ceded and followed the training sessions. All tests were administered by trained 
testers who were unaware of individual group assignment. 


Measures of Problem Solving Efficiency 


Efficiency in solving two types of problems was assessed by pre- and 
posttraining tasks. A block design task was selected to assess the child’s ability to 
analyze visual patterns. Four block design problems (two 4- and two 5-block pat- 
terns from the Weschler Intelligence Scale for Children—Revised [WISC-R] test) 
were individually administered as a pretest measure; four parallel designs were sim- 
ilarly presented as a posttest assessment. The number of seconds to solution (maxi- 
mum 300 seconds) and number of block placements were recorded as indices of 
problem solving efficiency. If the design was not completed after five minutes, the 
tester offered hints until the child was successful. 


The child’s ability to efficiently solve the Tower of Hanoi problem (Simon, 
1975) was also assessed during pre- and posttest periods. This classic task can be 
recursively solved and taps the child’s skill in recognizing repetitive patterns. The 
four- and five-disk versions were individually presented at pretest and the four- 
and five-car versions of a Tower isomorph (the parking lot problem) were present- 
ed at posttest. The sequence of moves made by the children was recorded by the 
tester and later arranged in problem-behavior graphs (Newell & Simon, 1972). The 
child’s efficiency of means-end planning was assessed by the number of moves to 
solution. Two indices of looping structure (tendency to reenter the same problem 
state again) were derived: (a) the average size of the loops (larger numbers reflect 
greater deviation from the optimally efficient solution path): and (b) average num- 
ber of loops which is also inversely related to solution efficiency. 


Measures of Problem Solving Flexibility 

Problem solving flexibility was measured by two creativity subtests from 
Wallach and Kogan (1965). Four different designs were presented to each child at 
pre- and posttesting periods, and children were asked to generate alternate inter- 
pretations. Similarly, four common objects were presented at the pretest period 
and children were asked to generate alternate uses; the procedure was repeated for 
four different objects at posttesting. Protocols were scored for fluency (number of 
responses) and novelty (uniqueness) as described by Wallach and Kogan. 


Measures of Problem Solving Integration 

The child’s ability to integrate two variables in problem solving was assessed 
using two Piagetian proportionality tasks adapted from Siegler (1981). Both tasks 
were group-administered and scored using Siegler’s rule assessment procedure in 
which the child’s answers are classified according to Piagetian stage. Four different 
stage rules were determined. The first task was a balance scale problem in which 
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six presentations showed a balance with weights at different distances from the ful- 
crum. The children were asked to predict which side of the balance would go down 
if a lever holding the balance in place was removed and to indicate their prediction 
by circling the chosen alternative on an answer sheet. 


The second proportionality task was based on probability judgments. Six prob- 
lems depicting sets made up of different proportions of white and red marbles were 
presented. For each problem, the marbles were arranged in two groups, and the 
children were asked to indicate which group they would want to choose from if 
they wanted a specified color of marble and had to pick with their eyes closed. 
Again, answers were recorded on an answer sheet. 


Measures of Socioemotional Aspects of Problem Solving, Planning, and 
Task-Involvement 


Measurement of social and emotional components of problem solving included 
self-report, sociometric, and observational assessments. 


Harter’s Perceived Competence Scale (1981) was used to assess the child’s per- 
ceived effectance and mastery motivation in cognitive tasks. This group- 
administered measure consists of 28 items and produces scores in five effectance 
dimensions: (a) preference for challenge, (b) curiosity, (c) mastery, (d) judgment, 
and (e) criteria source. High scores on these scales indicate an internal, mastery- 
oriented perspective. Children indicated their responses on a four-point scale in a 
structured alternatives format. The competence scale was given pre- and 
posttraining. 

A computer attitude scale for children (Williams, Coulombe, & Lievrouw, 1983) 
was used to detect potential changes in attitude during the training period. This 
scale consists of semantic differential items on seven-point scales representing the 
dimensions of (a) evaluation, (b) quality, (c) ease of use, and (d) expertness. 


The social status of participating children was also monitored to assess changes 
which may have emerged due to participation in the training conditions. The 
children’s peer reputations were assessed using the Pupil Evaluation Inventory 
(PEI) developed by Pekarik, Prinz, Liebert, Weintraub, and Neale (1976). On this 
measure, children were asked to indicate which classmates fit a set of behaviorally 
oriented descriptions. Factor scores were standardized within each class at pre- and 
posttesting. The PEI yields factor scores for aggression, likeability, and withdrawal. 


The children’s social, affective, and task-oriented behaviors were recorded dur- 
ing the tutoring sessions for the three training conditions (LOGO, BASIC, problem 
solving). Using a time sampling technique, observers focused on each child in 
randomized order for a 10 second period and then categorized the child’s behavior 
on the following dimensions: (a) task involvement (on, off); (b) affect (positive, 
negative, neutral); (c) social partner (peer, tutor, peer and tutor, none); 
(d) information (asks for, receives, gives); (e) attention (asks for, gives to other, re- 
ceives from other); (f) action (directs others, complies with other’s directive, resists 
other’s directive, imitates other’s actions, other); (g) overt planning (planning 
verbalization such as, “Let’s do it this way”, “We need to get some sticky stuff to 
hold these together”); and (h) social involvement (solitary or independent activity, 
equal collaboration, dominant). Category definitions are available from the authors 
on request. 

Observations were made for 20 minutes per session, three days per week, over 
the seven weeks of training. Three observations per child per session were 


180 


LOGO and the Transfer of Problem Solving 


collected, totaling a maximum of 54 time samples for each child. Absences and 
school closures resulted in individual variation in the number of observations. The 
actual number of time samples obtained averaged 50.8. 


Reliability between the two observers was assessed for all 18 children observed 
in one day per week for 14 weeks. Observers made independent categorizations of 
each child’s behavior. Percentage agreement between the observed ranged from 
69.87% to 100% across categories, averaging 91.9%. 


Procedure 


The training was conducted in two waves, with two schools participating from 
January to March and the remaining two schools from March to May. In both 
waves, the pre- and posttest measures were given in the two weeks preceding and 
following the seven-week training session. 


In the three training conditions, children were tutored in groups of six which 
met each day for a 40-minute period for seven weeks. The children were allowed to 
choose a partner. Each pair worked together on their own computer or materials, 
which gave each child 40 minutes of joint experience. A tutor attempted to ensure 
that each child within a pair had approximately equal computer or materials con- 
tact during a session. One day each week, each child was given 20 minutes alone on 
the computer or materials, while the child’s partner worked with the tutor on an- 
other training task. Tutors encouraged cooperative work with discovery-oriented 
processes in all training groups. 

The tutors surpervising each group were trained elementary school teachers 
with programing experience. Each tutor taught all three curricula equally and ro- 
tated among the training groups in order to balance teacher-specific effects. The 
tutors, observers, and supervising researchers met weekly to discuss problems and 
coordinate the tutor’s transition between groups. 

As much as possible, the LOGO, BASIC, and problem solving curricula were de- 
signed to present similar concepts and provide for equal amounts of peer interac- 
tion and individual activity. (Copies of each curriculum can be obtained on request 
from the researchers.) The LOGO curriculum was adapted from Watt (1983) and 
Abelson (1981). The BASIC curriculum was drawn heavily from Carlson (1982). 
The problem solving curriculum was adapted from the Productive Thinking pro- 
gram (Covington, Olton, & Crutchfield, 1974). 


Results 


All pre- to posttraining changes were assessed using an ANCOVA for posttest 
scores in which pretest performances were used as covariates. 


Problem Solving Efficiency 

Efficiency in problem solving was measured by the number of block placements 
and time to completion on a set of block design tasks. The children’s scores were 
summed across the four pretest problems and the four posttest designs. The aver- 
age number of block placements was 65.78 for the pretest and 58.32 for the 
posttest. The ANCOVA for posttest indicated no significant differences among the 
four experimental groups (group means and standard deviations [SDs] are present- 
ed in Table 1). Similarly, the ANCOVA for the number of seconds to completion 
indicated that experimental groups did not differ on the posttest. Across all groups, 
X — 465.65 seconds on the pretest designs and X= 460.99 seconds on the posttest. 
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TABLE 1 


Modified WIPSI Block Design 


Group 
Scale LOGO Basic Problem Control 
Solving 
Moves 
Pre 66.7(18.7) 63.2(17.9) 68.0(18.1) 65.3(19.6) 
Post 55.4(16.1) 53.5(17.0) 63.0(19.8) 61.4(19.9) 
Speed (sec.) 
Pre 460(153) 452(143) 472(150) 479(130) 
Post 426(153) 431(198) 510(173) 479(144) 


Problem solving effectiveness was also measured on the Tower of Hanoi task by 
three indices: (a) number of moves; (b) intersection points; and (c) moves per loop. 
For all indices, scores were summed across the four-and five-item versions. The 
average number of moves on the pretest Tower version was 122.28 across all four 
groups; X=109.23 for the parking lot version given at posttest (group means and 
SDs are presented in Table 2 for all three indices). Experimental and control 
groups were not significantly different at posttest, as indicated by the ANCOVA. In 
addition, groups did not differ on intersection points, although there was an appar- 
ent overall difference between pretest performance on the Tower (X=25.61) and 
posttest performance on the parking lot problem (X =13.50). 


Moves per loop were also calculated as an inverse index of efficiency. Across all 
four groups, pretest scores averaged 11.82 moves per loop and posttest scores aver- 
aged 11.10. The ANCOVA indicated a significant difference among groups at 
posttest, F'(3,87)=3.15, p<.03. A comparison of posttest means adjusted for pretest 
performance indicated that the classroom control group (X=14.49) and LOGO 
group (X=12.09) scores exceeded those shown by the BASIC (X=8.58) and prob- 
lem solving (X=9.23) groups. 


Problem Solving Flexibility 


Flexibility was measured by fluency (number of responses) and novelty (num- 
ber of unique responses) for both the patterns and alternate uses tasks. In pre- and 
posttests, the children’s scores were summed across four items. The group means 
and SDs of the creativity tests are presented in Table 3. Fluency scores on the pat- 
tern pretest averaged 12.96 on the pretest and 11.23 on the posttest; for the novelty 
index, Xs=2.90 and 2.35 for pre- and posttests respectively. The ANCOVAs indi- 
cated no significant group differences for novelty, but fluency posttest scores 
among groups were significantly different, F(3.87)=3.15, p<.013. The adjusted 
posttest means indicated that the problem solving training group showed the 
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TABLE 2 


Tower of Hanoi 


Group 
Scale LOGO Basic Problem Control 
Solving 
Moves 
Pre 131(45.7) 114(389.0) 116(48.0) 128(48.2) 
Post 104(36.0) 118(35.9) 106(30.5) 116(45.4) 
Intersects 
Pre 28.9(23.5) 21.1(16.5) 23.1(17.0) 29.3(29.8) 
Post 5 (1326) 13.5(16.5) 12.0(12.8) 17.0(20.0) 
Moves/Loop 
Pre Teer (12.3)) 9.18(6.16) IX AKO.) 12.9(11.8) 
Post 2413.3) 8.13(5.94) 9.18(7.385) 14.7(10.5) 


greatest fluency (X=12.74) followed by the BASIC group (K=11.42). The LOGO 
and control groups showed the least fluency (X=10.06 and 10.70 respectively) on 
the pattern task. 

Across all groups, fluency scores for the alternate uses test averaged 17.10 
responses on the alternate uses pretest and 13.11 on the posttest. Novelty scores on 
this task averaged 1.55 and 1.30 for the pre- and posttest respectively. No signifi- 
cant differences among groups were found on either measure. 


Measures of Integration of Task-Relevant Variables 

The child’s ability to integrate the influence of two variables in a compensatory 
relationship was assessed with two Piagetian tasks. At formal operations, the child 
should be able to understand the relationship between distance from the fulcrum 
and weight in predicting how a balance will respond. Similarly, successful solution 
of the probability tasks requires the integration of the relative number of two dif- 
ferent colored marbles in two arrays. Using Siegler’s (1981) presentation and scor- 
ing criteria, each child’s pattern of responses was categorized as guided by one of 
four rules. Of the 96 children tested, the responses of 11.5% could not be classified 
as indicating a consistent rule on the balance problem and 5.2% could not be clas- 
sified by rule on pre- or posttesting on the probability problem. 

In order to assess changes in integration skill due to training, the child’s level 
(rule) on each task on the pretest was subtracted from the level shown on the 
posttest. The distribution of children who decreased one level, stayed the same, in- 
creased one level, and increased two levels is shown in Table 4 for both the balance 
and probability problems. A chi-square analysis for the balance problem indicated 
no significant relationship, however, between training group experience and pre- to 
posttest change. The data shown in Table 4 indicate the lack of variation in 
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TABLE 3 
Creativity Test 
Group 
Scale LOGO Basic Problem Control 
Solving 

Alternate Uses: 

Fluency 

Pre 17.7(8.8) 17.5(8.0) 16.4(3.4) 16.9(4.8) 

Post 13.0(5.8) 13.5(4.4) 12.8(3.9) 13.1(6.9) 
Alternate Uses: 

Novelty 

Pre 1.75(1.8) Poa0Ga) LiL7Ch a) 1.78(2.2) 

Post 0.88(.95) TAs) 1.46(1.5) L627) 
Patterns: 

Fluency 

Pre 12-3(6-5) 12.8(6.6) 11.3(4.2) 15.6(16.8) 

Post 9,92(5.5) 11.4(4.1) 12.4(5.6) 11.3(9.2) 
Patterns: 

Novelty 

Pre 263824) 2.88(3.2) 1.83(1.4) 4.25(8.3) 

Post 1.96(2.0) 2.71(2.6) 2:25 (2:6) 2.46(4.6) 


children’s scores on the probability problem. Of those who could be classified by 
level, all children used rule two at both testing periods. Thus statistical testing 
would have been meaningless. 


Measures of Socioemotional Aspects of Problem Solving 


The children’s attitudes toward their own effectance in classroom performance 
were assessed using the Harter Scales (1981). The higher the child’s score on these 
scales, the greater the internality of responses. Four factor scores for each child 
were calculated at pre- and posttest and are displayed in Table 5. Across all four 
groups, pre- and posttest scores for the challenge subscale averaged 3.08 and 2.97 
respectively, 3.14 and 2.97 for the curiosity subscale, 2.87 and 2.96 for the mastery 
subscale, 2.28 and 2.49 for the judgment subscale, and 2.71 and 2.95 for the criteria 
subscale. ANCOVAs for posttest scores indicated groups were not significantly dif- 
ferent on any of the five factors. 

Potential changes in the children’s attitudes towards computers were assessed 
using a self-report scale with four factor dimensions. Overall pretest scores for the 
evaluation factor averaged 1.34 on a seven-point scale in which lower scores repre- 
sent the more positive pole. Posttest scores averaged 2.84. Similarly, pre- and 


184 


LOGO and the Transfer of Problem Solving 


TABLE 4 


Piagetian Tasks 


A) Balance Beam 


Group 
Level LOGO Basic Problem Control 
(Post-Pre) Solving 
= 2 2 2 2 
0 14 14 10 6 
i 7 4 8 13 
2 0 1 0 @) 
? 1 3 4 3 
B) Probability 
Group 
Level LOGO Basic Problem Control 
(Post-Pre) Solving 
4. 0 0 0 0 
0 Dis 24 21 28 
E 0) 0 0) 0 
2 0 0 0 0 
? ib 0 3 1 


posttest means for the quality dimensions were 2.03 and 3.18 respectively; 3.44 and 
3.64 for ease of use, and 2.37 and 3.33 for expertness of the computer. ANCOVAs 
for the four posttest factor scores indicated no significant difference among experi- 
mental conditions in attitude towards computers. Group means and SDs are pre- 
sented in Table 6. 

The social status of children in participating classrooms was assessed by peer 
report before and after the training period. Individuals’ scores on aggression, with- 
drawal, and likeability dimensions were standardized within the class, using same- 
sex peer nominations, at each testing period. ANCOVAs for each of the three di- 
mensions revealed no significant group difference. Table 7 presents group means 
and SDs. 

Observations of the three training groups (LOGO, BASIC, and problem solving) 
were designed to capture social and affective dimensions of the tutorial experience, 
as well as overt planning and task involvement. The means and SDs of each of the 
observations are presented in Table 8. 

Because there was a trend toward group differences in number of observations, 
F(2,66) =2.61, p<.08, the total number of observations was used as a covariate in 
the analysis of group differences. 

Significant group differences emerged in the social interaction of the children 
during the training sessions. The number of intervals in which the child was ob- 
served interacting with a peer partner, F(2,65)=7.28, p<.001, teacher partner, 
F(2,65=10.84, p<.001, or with both peers and teachers, F(2,65)=25.90, p<.001, 
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TABLE 5 


Harter Perceived Competence Scale 


Group 
Scale LOGO Basic Problem Control 
Solving 
Challenge 
Pre 2.91(.66) 2.97(.69) 3.22(.49) 3.20(.59) 
Post 2.92(.62) 2.97(.63) 2.90(.54) 3.08(.59) 
Curiosity 
Pre 3.18(.47) 3.09(.55) 3.07(.43) S-2 14.08) 
Post 2.93(.32) 2.98(.58) 2.82(.43) 315C0e8) 
Master 
Pre 2.85(.59) 2.88(.55) 2.97(.67) 2.77(.68) 
Post 2.86(.54) 2.93(.59) 3.03(.48) 2.99(.62) 
Judgment 
Pre DESLV.O4) 2.32(.49) 2.31(.63) 2.18(.70) 
Post 2.61(.66) 2.42(.64) 2.60(.48) 2.34(.57) 
Criteria 
Pre 2.65(.74) We WA lati ay) 2.62(.66) Dawe LO) 
Post 2.95(.53) 2.94(.76) 2.89(.73) 3.03(.86) 


showed significant group differences in ANCOVA analysis. Means were adjusted 
for differences in the total number of observations. Comparisons of adjusted means 
for peer partners indicated highest peer involvement for the BASIC group 
(X=15.41), followed by the LOGO (X=13.39) and problem solving (X=11.30) 
training. Teachers involvement was highest in the LOGO (X=18.65) group, fol- 
lowed by the BASIC (X=15.64) and problem solving (X=12.97) group. The prob- 
lem solving training, however, showed the greatest number of simultaneous peer 
and teacher interactions (X=17.95), compared to the LOGO (X=11.42) and 
BASIC (X=10.27) sessions. 


The children in the BASIC group were found to give more information to 
others (X=15.08) than children in the other groups (Xs=10.71 for LOGO and 
11.83 for problem solving) at a trend level of significance, F(2,65) =2.83, p<.06. 

Groups differed in the directive aspects of interaction. The ANCOVAs for com- 
pliance, F(2,65)=15.66, p<.001) and directs others, F(2,65)=28.00, p<.001, were 
significant. Comparison of the adjusted means indicated more compliance in the 
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TABLE 6 


Children’s Attitudes Towards Computers 


Group 
Scale LOGO Basic Problem Control 
Solving 
Evaluation 
Pre 1.33(.60) 1.36(.50) 1.26(.38) 1.41(.43) 
Post 2.93(2.4) 2.66(2.1) Dae eo) 3.04(2.3) 
Quality 
Pre 2.03(.69) 2.10(.81) 2.05(.87) 1.94(.68) 
Post 3.30(1.6) 3.00(1.6) Sullolee) Be ASK (iL s7) 
Kase 
Pre 3.10(1.3) 3.23(1.4) 3.75(1.4) 3.70(1.5) 
Post Shon (Ulse)) 3.58(1.5) 4.06(1.1) 3.34(1.4) 
Expert 
Pre 2.40(1.0) 2.60(1.1) 2.16(.80) 2.31(.75) 
Post 3.09(1.7) 3.38(1.7) 3.15(1.6) Beek (les) 
TABLE 7 
Pupil Evaluation Inventory 
Group 
Scale LOGO Basic Problem Control 
Solving 
Agression 
Pre — .34(.86) — .36(.57) .10(.94) .24(1.18) 
Post — .18(.90) = ALEX AY) .03(1.13) .12(1.08) 
Withdrawal 
Pre .03(.89) .14(1.01) = 32 (02) .01(.84) 
Post .11(.79) .18(1.29) Serel(@O3) .16(.70) 
Likeability 
Pre .20(.82) .15(1.03) 206 (mle) Sali le Oni} 
Post .17(.87) .05(.92) — .03(1.00) — .05(.96) 
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TABLE 8 


Observational Measures 


Group 

Behavior LOGO Basic Problem 

Solving 
Off Task 1.25(1.4) 1.21(1.8) 1.88(1.9) 
Negative Affect Hig aeuy) .38(.65) .75(1.29) 
Positive Affect 5.92(5.9) 6.17(4.4) 8.96(6.4) 
Solitary Activity 8.13(2.6) 8.79(3.1) 8. 7113.1) 
Peer Partner 12.8(4.3) 16.0(3.7) 11.3(4.1) 
Teacher Partner 18.0(4.5) 16.3(4.9) 12.9(3.8) 
Overt Planning 4.00(2.3) PTA (ES) 1.83(1.6) 
Peer and Adult Partner 10.5(3.5) 11.2(3.0) P79 2) 
Receives Information 23.8(6.0) 23.9(6.2) 25 s1( 7.2) 
Asks For Information 2.67(2.3) 2.96(1.9) 2.96(5.1) 
Gives Information 10.7(3.3) 15. 10525) 11.8(6.3) 
Compliance Zollo) 3-21(1-4) 0.92(.88) 
Directives 2.30(1.3) 4.54(2.8) 0.50(.59) 
Resists 0.12(3.4) 0.21(.59) 0.21(.42) 
Equal Exchange 4.96(3.9) 4.58(3.5) 4.08(2.2) 
Dominant Exchange 36.6(4.9) 39.1(5.3) 38.1(5.7) 


LOGO group (X=3.09) and BASIC (X=2.40) than in the problem solving group 
(X=0.98). Children were also less directive in the problem solving (X=0.57) than 
in the BASIC (X=38.58) and LOGO (X =3.29) training groups. 


The number of time intervals in which direct planning was observed also signif- 
icantly differed among groups, F(2,65)=7.59, p<.001. Calculation of adjusted 
means revealed greater overt planning in the LOGO training group (X =4.58) than 
in the BASIC (X=2.13) and problem solving (X =1.88) group. 


No other significant group differences were found in the ANCOVAs for 
observational measures. 


Discussion 


In general, the results of the pre- and posttests indicated relatively few changes 
in problem solving skills which could be attributed to training. No group differ- 
ences were found for the ability to integrate two variables. Only one of the effi- 
ciency indices showed posttest differences, and that difference favored the problem 
solving and BASIC programming training. These two groups also showed greater 
gains in flexibility than did the LOGO and classroom control groups. This rather 
sparse pattern of results led us to conclude that there was no generalized benefit 
from 40 hours of computer training on classical measures of problem solving skills. 


There were also very few group differences in the attitudinal and mastery meas- 
ures, indicating that the computer experience did not hold any particular advan- 
tage in promoting feelings of internal control over learning among the children in 
the study. Our subjects were typical grade six children, and our findings certainly 
do not exclude the possibility that mastery may be enhanced by providing 
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computer experience to other children, such as those who have failed in traditional 
school curricula. 


Although the focus of this study was the question of general transfer, several 
process dimensions were measured through observation of the instructional ses- 
sions. Our observations focused on social, emotional, and task-oriented components 
of the children’s behavior. We will concentrate on the descriptive rather than ex- 
planatory function of this process measurement given that there were few changes 
which could be attributed to training. 


In general, motivation among all three training groups was relatively high. This 
was reflected in the low levels of off-task behavior and the preponderance of posi- 
tive over negative affect. The children in the problem solving group apparently had 
as much fun as the computer group, suggesting that computers in themselves do 
not have special power to elicit positive affect. In all the training groups there was 
also a high rate of information exchange (combining categories of giving, receiving, 
and asking for information). Children in the BASIC group showed more 
information-giving than the other two groups. The observers’ written comments in- 
dicated that much of this information was shared with peers. Further observational 
research should make a finer differentiation in the function of these exchanges 
(e.g., answering questions, volunteering advice, clarifying ambiguities). Of particu- 
lar interest would be the extent to which peer tutoring occurs in computer training 
groups. The potential for major benefits may be relatively strong in this context, 
possibility leading to increased confidence, sharper communication skills, and en- 
hanced understanding of the concepts being taught. 

One of the few positive findings in favor of the LOGO group was a higher fre- 
quency of overt planning. Children in this group showed greater explicit 
verbalization of advance planning than the other groups. These public acts of plan- 
ning may indicate more reflective consideration of task demands. They may also 
have served a social function by facilitating shared task-oriented activity. A curric- 
ulum which enhances this overt planning between children should also lead to more 
precise communication skills and cooperative problem solving. These specific pro- 
cesses were not tested in the current study. 

One of the initial fears about the widespread use of microcomputers was that 
children would become “vidiots,” isolated from normal peer interaction and defi- 
cient in important social experiences. Although the data are sparse (see Rose- 
Krasnor & Mitterer, 1985 for a review), the research seems to indicate that com- 
puters can actually facilitate social interaction when compared to more traditional 
school activities. The current study clearly revealed a high level of social interac- 
tion in both computer training groups. The nature of the interaction was not the 
same across groups, however. The children in the LOGO group had relatively more 
interactions with the tutor; the children’s interactions in the BASIC group were 
characterized by relatively more peer-only contact. In contrast, children in the 
problem solving group showed a relatively high degree of simultaneous contact with 
peers and tutors. These results were not due to individual differences between 
tutors, since tutors were counterbalanced across sessions for all groups. Interaction 
differences may have resulted from differences in the types of problems character- 
istic of each of the training programs. The desirability of the different patterns of 
interaction is both an empirical and philosophical question. Our data indicate, 
however, that there are reliable interaction differences among the groups. 
Knowledge of these qualitative attributes may help educators decide among train- 
ing programs once curriculum goals are decided. 
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Our failure to find significant sex differences was noteworthy. There has been a 
concern among some educators that girls would show less interest and motivation 
for computers than boys. Observations indicated that girls were as motivated and 
involved in the computer training groups as were their male classmates. Our single 
significant sex difference indicated that boys had a higher level of off-task behavior 
than girls, but it did not vary by group. Girls and boys showed equivalent affect, 
planning, information exchange, and social involvement. Similar observations in 
free choice settings in which other noncomputer activities are available, however, 
may show a different gender pattern. 


The overall pattern of results suggests that the LOGO experience provided in 
this study did not promote the transfer of LOGO learning to other problem solving 
situations. This finding, which can be viewed as a disappointing one, raises ques- 
tions about LOGO transfer and the conditions under which it might be obtained. 
In searching for those conditions, we confronted a number of important theoretical 
and methodological issues which will be addressed in the remainder of this paper. 


One possibility which must be considered, contrary to Papert’s (1980) claims, is 
that LOGO affords extremely limited opportunities for transfer. Such a view draws 
support from two major themes of the extensive literature on transfer: that much 
knowledge is domain-specific and that general problem solving heuristics are very 
“weak” (e.g., Simon, 1980). The skills which children learn from programming in 
LOGO may be too domain-specific and/or weak to foster measurable transfer. If 
this view is correct, then LOGO will be much less generally useful than previously 
thought. One way to address this possibility is to apply classical principles articu- 
lated in the extensive literature on transfer to LOGO learning. Only a few research- 
ers have so far explored this approach (e.g., Krasnor & Mitterer, 1984; Gallini, 
1985). Such an analysis might reveal just how much domain-specific knowledge 
versus general problem solving heuristics are involved in learning LOGO and thus 
could allow a better understanding of whether or not transfer should be expected. 


Another possibility is that LOGO learning will transfer to other domains but 
not under the conditions of the current study. If this is the case, just what are 
those other conditions? Our search for alternate conditions has made it clear to us 
that there exists a large gap between the LOGO philosophy espoused by Papert 
(1980) and others and “LOGO reality”: the actual events which transpire daily in 
classrooms. Another way of expressing this is that we do not yet have a “theory of 
LOGO?” which is capable of making predictions about the conditions under which 
transfer might or might not occur. In our view, theoretically motivated positions on 
the following unresolved key issues (and their interactions) constitute the mini- 
mum requirements for such a theory: 


(1) What Do Children Learn About LOGO in the First Place? 


Proponents of LOGO have so far failed to articulate what it means to learn to 
program in LOGO. An unstated assumption behind the widespread acceptance of 
Papert’s ideas is that children can and do learn to program in LOGO. The classic 
pre- posttest design which formed a major part of the current study was adopted 
with this as an implicit assumption. As such, this design told us little about what 
children actually learned, how changes occur when learning LOGO, or the specific 
processes which were responsible for change. A program evaluation using only a be- 
fore/after design is thus both premature and incomplete; a more process oriented 
approach is required to yield information about the potential mechanisms of 
change, as well as the qualitative aspects of children’s experiences during training. 
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Recently, this point has been exceptionally well made by workers at the Bank 
Street College of Education (e.g. Kurland, 1984; Pea & Kurland, 1984). Pea and 
Kurland decry the view in which “Programming serves as a ‘black box,’ an 
unanalysed activity whose effects are presumed to irradiate those who are exposed 
to it” (p. 9). Instead they argue that “questions about the development of program- 
ming skills require a breakdown of the skills into component abilities, and studies 
of how specific aspects of programming skills are acquired” (p. 9). While the cur- 
rent study did examine process measures, these were not by and large directly rele- 
vant to assessing what was learned. 


Thus it will be necessary to focus more attention on defining just what it means 
to learn to program in LOGO in the first place. We will then have a much better 
idea of what changes we might reasonably expect to test with pre- posttest designs. 
For example, Kurland and Pea (1983) recently examined children’s ability to learn 
about recursion—a programming construct which Papert (1980) implied that chil- 
dren can easily learn. However, these investigators convincingly demonstrated that 
although children can learn to write recursive programs, they do not understand 
them. On the basis of this result, it may fairly be concluded that testing for the 
transfer of a recursive style of problem solving is likely to reveal a null result. Such 
information is obviously useful in determining how to test for transfer. In the 
course of revealing just what is learned about LOGO, more of such critical analysis 
and empirical demonstration will make possible better predictions about transfer. 


(2) How Much Experience with LOGO is Necessary to Ensure Transfer? 


One commonly accepted principle is that transfer will not occur until knowl- 
edge is sufficiently well developed in the domain of original learning (Ellis, 1965). 
Thus how much experience with LOGO will be required before transfer can be ex- 
pected? In the current study, one obvious possibility is that the children were not 
given enough learning time to manifest transfer, even though 40 hours of instruc- 
tion is more than most other training studies have given. However, the LOGO liter- 
ature does not yet provide any basis for making reasoned estimates of how much 
experience is enough. Until such time as theory has been developed to the point of 
allowing principled predictions, we must rely on the results of experiments such as 
the present one to empirically address the question. 


(3) What Type of LOGO Curriculum is Required to Support Learning for 

Transfer? 

It will also be necessary to specify just what type of learning experience is best 
suited to promote LOGO learning and transfer. Thus it is also possible that our 
curriculum was inappropriate, even though we deliberately modeled it to reflect the 
“LOGO philosophy.” Perhaps we created too much structure to allow the free ex- 
ploration necessary to foster the emergence of deep insights into mathematics, 
computation, and thinking itself. Alternately, perhaps we offered too little struc- 
ture, thus not ensuring adequate exposure to and practice with critical concepts. 
Further, Krasnor and Mitterer (1984) noted that the approach to teaching LOGO 
presented by Papert (1980) failed to consider several well established transfer prin- 
ciples. 

An analysis of the existing transfer literature might suggest ways of modifying 
the LOGO curriculum to make students explicitly aware of the general utility of 
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powerful ideas. This awareness may enhance the likelihood that learning to pro- 
gram will produce benefits in general problem solving skills. Again the extant liter- 
ature on LOGO offers only the broad outlines of a LOGO curriculum which might 
foster transfer. We need much more theoretical and empirical elaboration of this 
question. 


(4) Can Every Child Learn LOGO for Transfer? 


Common sense tells us that not every child can learn LOGO well enough to 
show transfer. Some children, especially the very young, might not be able to learn 
it at all. Thus it might be argued that our choice of children was not optimal, al- 
though it was motivated by the argument that only formal operational children 
could benefit enough from learning LOGO to show transfer. Perhaps this argument 
is wrong. Perhaps some other age group (or distribution of males and females, or 
socioeconomic group, etc.) would have shown transfer. Again, the existing literature 
on LOGO is simply not adequate enough to support more precise predictions. 


(5) What is the Right Way to Measure LOGO Transfer? 


It might be argued that the failure of the current study to find general benefits 
of LOGO stem from the inadequacy of the measures of transfer, even though they 
are traditionally accepted measures of the efficiency, flexibility, and integration of 
general problem solving skills. It could be asserted, for example, that these meas- 
ures do not assess the type of learning that is uniquely occurring when children 
learn LOGO. A thorough analysis of what is learned from LOGO would be an ex- 
tremely useful guide for selecting more sensitive and “LOGO-appropriate” meas- 
ures of transfer. 


In retrospect, it is amazing how little we know about the impact of computers 
and LOGO on children’s learning. It will, in our view, take quite some time before 
we develop a sufficiently rich theoretical base to offer definitive answers to the key 
questions outlined above. The current study is important in this broad context be- 
cause it sets boundary conditions.on when transfer might be expected by revealing 
at least one set of conditions under which transfer was not obtained. Such studies, 
each exploring different combinations of experimental parameters and measures, 
will serve to inform future theoretical work and thus, ultimately, to clarify the use- 
fulness of LOGO for developing general thinking skills. 


While this study obviously does not constitute definitive evidence against the 
existence of some as yet unknown set of conditions under which transfer might 
occur, it does carry strong practical implications. The seven weeks of daily small- 
group training by specially trained tutors exceeds the intensity of computer in- 
struction currently available in most schools. Therefore, it is likely that educators 
realistically should not expect dramatic changes in thinking skills as a result of cur- 
rent levels of computer instruction. 


In conclusion, we did not find the revolution in thinking skills which might 
have been expected on the basis of glowing anecdotal reports and early theoretical 
claims. As such, we are sceptical that teaching LOGO (or BASIC), especially under 
current conditions, will be useful as a way of teaching general thinking skills. We 
do, however, acknowledge the possibility that once we know much more about what 
it means to learn to program and about how to build LOGO curricula to explicitly 
foster transfer, LOGO might still be an excellent means to this admirable instruc- 
tional end. Such a demonstration, however, is not likely to become available in the 
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immediate future. This pessimistic assessment must not, however, be construed as 
an argument against the use of LOGO in schools. We believe that the LOGO phi- 
losophy currently represents one of the most interesting ways of thinking about 
how to use computers in schools. There are a variety of reasons why LOGO should 
still be seen as having considerable promise in the field of education. It is, In our 
opinion, the best language currently available for introducing children to program- 
ming as a skill in its own right. It is an intrinsically interesting activity, and is also 
a useful tool for the explicit exploration of topics in mathematics (Abelson & 
DiSessa, 1981), linguistics (Rowe, 1976), and artificial intelligence (Bundy, 1980). 


Notes 


1. We would like to express our appreciation to the teachers and principals of the Lincoln County Roman Catholic Separate 
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would also like to especially thank Mary Lou Orr and Roberta Dyment who worked very hard to provide the children with 
excellent instruction, and to the parents and children who showed great goodwill and enthusiasm. This project was funded 
by Grant #410-83-0809 from the Social Sciences and Humanities Research Council of Canada. Reprints are available from 
Linda Rose-Krasnor, Department of Psychology, Brock University, St. Catharines, Ontario, L2S 3A1. 
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Factors Related to Cursive Writing Skill 


This study explored the relationship between quality of writing and certain instruc- 
tional strategies and materials. A representative sample (n=137) of grade three stu- 
dents was used to obtain both printing and writing samples during the same school 
year. A previously developed Diagnostic Inventory was used to evaluate each sample 
for neatness and the commonly recognized major elements of form. Several relation- 
ships were discovered between quality of printing and certain instructional practices. 
These relationships tended to disappear as the students moved from printing to writ- 
ing. The results were thought to have implications for the preservice and inservice 
training of primary grade teachers. 


Manuscript writing (i.e., printing) normally gives way to cursive writing in 
grade two or grade three (Freeman, 1958; Herrick & Okada, 1963; Peck, Askov, & 
Fairchild, 1980; Renaud & Graff, 1966; Suen, 1975). Niemann’s (1971) research 
suggested that children in the second or third grade make a smoother transition 
than do older children. Handwriting manuals usually recommend flexibility in the 
choice of transition time frequently based on an acceptable level of proficiency in 
manuscript writing (Suen, 1975). Teachers normally attempt to individualize the 
transition by initially teaching cursive writing to only the better printers, assuming 
that proficiency in printing is related to proficiency in handwriting. Some research 
(Otto & Smith, 1980; Wallace & Kauffmann, 1978) supports this latter notion by 
suggesting that cursive writing be withheld from students who experience difficulty 
with printing. 

Armitage and Ratzlaff (1985), using the Diagnostic Inventory as a measure of 
writing achievement, demonstrated a low or no correlation on six of seven aspects 
of writing skill when comparing manuscript writing samples and cursive writing 
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samples of grade three students. The only aspect that registered a high degree of 
relationship was neatness. This result caused the researchers to conclude that 
teachers need not individualize the transition from manuscript to cursive writing 
and to recommend that all class members be moved from one to the other simulta- 
neously. 


The present study sought to explore antecedent factors that might be related to 
quality of writing, either manuscript or cursive. What practices and/or materials 
are related to writing achievement as measured by the Diagnostic Inventory? 
Briggs (1970, 1980) demonstrated that students who write badly suffer lower marks 
when content in written composition is held constant. The study suggests that 
evaluators tend to form an image of student competency and personality based on 
quality of writing. There exists, then, a need to promote quality writing and a need 
to determine if certain teacher practices or materials are related to quality of writ- 
ing. 

Rubin and Henderson (1982) collected normative data on the instructional 
practices of teachers who taught nine- and ten-year-olds from a borough in Lon- 
don, England. Teachers appeared to have no consistent guidelines for teaching 
handwriting. In fact, one in ten teachers did not even provide specific handwriting 
instruction. Among those that did, Rubin and Henderson found much variation in 
terms of such factors as method of instruction, frequency of practice, length of les- 
son, and writing style taught. They go on to relate success in handwriting to the 
motivation and ability of the child, as well as the methods and ability of the teach- 
er. 


Method 


The sample (n=175) for this study represented seven intact grade three classes 
from six schools in three school districts. All classes, schools, and districts were 
chosen so as to obtain a representative sample of grade three students. Data analy- 
sis involved those subjects (n=187) from whom both manuscript and cursive writ- 
ing samples were received. 


These students produced the manuscript writing sample in October and the 
cursive writing sample the following April. The samples were forwarded to the re- 
searchers for scoring using the Diagnostic Inventory to score seven facets of writing 
evaluation: letter formation, letter size, uniformity of slant, spacing of letters, spac- 
ing of words, word alignment, and neatness. Five raters were used in scoring the 
samples. These scores, along with responses from a questionnaire completed by the 
teachers in April, were used in the data analysis. 


Results 


The reliability and validity of the Diagnostic Inventory were established and 
are reported elsewhere (Armitage & Ratzlaff, 1985). Inter-rater reliability was good 
and a subject’s score was taken as the arithmetic mean of the five raters. This aver- 


age score was accepted as a reasonable approximation of each subject’s true score 
(Allen & Yen, 1979). 


Responses from the teacher questionnaire were studied to determine which in- 
structional strategies and materials were commonly used. Table 1 reveals that ap- 
proximately 80% of the students received writing instruction that (a) did not 
involve alphabetic order of presentation, (b) did go from lower case to upper, and 
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(c) did not mix lower and upper case in a given lesson. No students were instructed 
to write using vertical strokes before moving to slant. 


Table 1 also suggests that teachers were relatively split in the use of special 
methods. About as many students as not received instruction using (a) hand and 
arm air movements, (b) verbalization of hand and arm movements, and (c) practice 
on a chalkboard. Most students (70%) did not trace letters and words on paper 
prior to writing freehand. 


TABLE 1 
Comparative Use of Various Instructional Strategies for the Teaching of Writing 
(n=137) 
Instructional Practice Yes No Missing 
1. Sequence of Instruction 
a. alphabetic order 25 94 18 
b. lower to upper 94 20 18 
c. mixed lower and upper 20 94 18 
d. vertical to slant 0 119 18 
2. Special Methods 
a. air movements (hand) 66 Fpik 0 
b. verbalizing movements 73 64 0 
c. practice on chalkboard 76 61 0 
d. tracing on paper 41 96 0 


The main analysis involved the relationship between printing or writing scores 
and various instructional practices and materials. Table 2 identifies those instruc- 
tional practices for which there were significant relationships and those for which 
there were none. Generally, there were no statistically significant (a=.05) relation- 
ships between instructional practices or materials and assigned scores for cursive 
writing samples. Specifically, only scores for letter size (W2), alignment (W6), and 
neatness (W7) were related to instructional practice for three of nine practices. 
Higher scores were registered by those who (a) used air movements, (b) traced on 
paper, and (c) used paper that was not interlined or was 3/8 interlined. A higher 
score was also registered in neatness (W7) for those students involved in 
verbalizing the hand movements. 

Printing scores, on the other hand, were significantly related on all seven meas- 
ures for five of nine practices. The other four of nine practices were related on one 
to four of the measures. Specifically, higher scores were registered by those stu- 
dents whose teachers (a) used alphabetic order in lesson presentations, (b) did not 
go from lower to upper case but, rather, mixed the order, (c) used hand and arm 
movements, but (d) did not verbalize these movements. Those students exposed to 
chalkboard practice tended to score higher on letter size (P2), uniformity of slant 
(P3), spacing of letters (P4), and alignment (P6). Students using noninterlined 
paper also scored higher on P2 and P6, as well as letter formation (P1). If students 
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TABLE 2 


Significant* Relationships (X) Between Select Instructional Practices and Facet 
Scores Assigned to Printing (P) and Writing (W) Samples (N= 137) 


Instructional Evaluation Facet 
Practice PIP P2 P39 E49 PS) P6VE Wil Ww 2 Wee 4a Vip ee Om Wai 


1. Instructional Sequence: 


a. alphabetic order EXON PN NP OS 
b. lower to upper EX REPENS SC EEX! 
c. mixed lower to upper EEN eX EX XCORIX! 


2. Special Methods: 


a. air movements Deke olen Pe Ores rake (Ke Pa’ XxX Me 
b. verbalizing movements XEN EX EX PEXO TEKS TEX, xX 
c. chalkboard practice XX AX x 

d. tracing on paper XxX xX xe xX x 

e. type of paper XO xX xX <eeeX: 
f. followed pub. series X 


ly oO 9) 


were exposed to tracing on paper, they scored significantly higher on letter forma- 
tion (P1), spacing of words (P5), and neatness (P7). Students of teachers who used 
a published series of writing lessons scored higher on one facet only, namely, spac- 
ing of words (P95). 


Discussion 


Certain instructional strategies and materials appear to be related to quality of 
writing produced by students, especially for printing, thus supporting the research 
results reported by Rubin and Henderson (1982). Instruction that introduced 
printing alphabetically and did not routinely move from lower to upper case, rather 
mixed the order, produced significantly higher scores on all seven evaluated facets. 
Higher quality printing appeared to be related to instruction involving hand and 
arm movements alone without a verbalization of the movements. The significant 
differences or relationships disappeared when cursive writing scores were consid- 
ered, except for the use of air movements where three of the seven evaluative facets 
registered significantly higher scores. 


Instruction involving practice on the chalkboard or tracing letters and words on 
paper appeared to be related to higher quality printing, but on mutually exclusive 
evaluative facets. Taken together, they were related to significantly higher scores 
on all seven facets. Type of paper used for instruction did not appear to be too sig- 
nificant and resulted in higher scores for both printing and writing on only three of 
seven evaluative facets. However, as with the use of hand and arm air movements, 
both tracing on paper and type of paper used appeared to relate to higher scores 
for writing, not printing alone. 


It would be worth noting that it would be rather naive to suggest that the above 
relationships were unaffected by the instruction received by these grade three stu- 
dents in grade one and two and were solely due to instruction received from their 
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grade three teacher. Data was not collected on instructional practices and materials 
used in their grade one and grade two experience. 


Conclusion 


Several factors investigated in this study appear to be related to quality of 
printing and writing, particularly the former. However, the relationship between 
several instructional strategies and materials and quality of printing appears to dis- 
appear when the same subjects switch to cursive writing. Sequence of instruction 
factors had no continuing effect, while several special methods of instruction 
seemed to have a minor relationship to both manuscript and cursive writing. 
Based, then, on this study, it would appear that there is no abiding relationship be- 
tween certain instructional techniques or materials and quality of handwriting. 


This research result, along with that of Rubin and Henderson (1982), has impli- 
cations for the preservice and inservice training of primary grade school teachers. 
Certain teaching practices, strategies, and materials appear to be related to learn- 
ing manuscript writing. This relationship would appear to be “washed out” when 
moving to cursive writing. Based on this research result and that reported earlier 
(Armitage & Ratzlaff, 1985), it would seem reasonable to suggest that primary 
teachers move their students simultaneously from printing to writing and not con- 
cern themselves too heavily with the issue of identifying the best practices, 
strategies, and materials. 
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Two Teachers’ Perspectives on Social 
Studies Teaching in French Immersion 


Researchers have paid little attention to the question of how French immersion 
teachers manage to integrate second language teaching with social studies teaching. 
A case study of the perspectives of a grade one and a grade two teacher on social 
studies teaching in a French immersion situation disclosed that the teachers traded 
off social studies concept and inquiry skill development goals for second language de- 
velopment goals such as developing students’ vocabulary and grammatical structures. 
The teachers’ decisions appeared to be influenced by their perceptions of parents’, 
administrators’, and consultants’ expectations that they develop students’ 
proficiency in French combined with their own beliefs and assumptions about second 
language teaching and learning and child socialization. 


Since the inception of French immersion programs in the 1960s, experimental 
studies focusing on the educational outcomes have consistently demonstrated that 
French immersion schooling has no negative effects on students’ academic and lin- 
guistic competencies, and on their intellectual and general development (Barik & 
Swain, 1976; Calgary Catholic Schools, 1979; Council of Ministers of Education, 
Canada, 1982; Cummins, 1976, 1983; Day & Shapson, 1985; Genesee, 1984; Gray & 
Cameron, 1975; Laberge, 1984; Lapkin & Swain, 1984; Shapson & Kaufman, 1976, 
1978; Stern, 1984; Swain & Lapkin, 1981, 1983). Some researchers attribute the 
success of immersion to a “communicative competence” methodology, which is 
supposed to provide students “with a great deal of comprehensible input” 
(Krashen, 1984, p. 41) so that they learn French incidentally while dealing with the 
regular school curriculum (Wesche, 1984). However, Bibeau (1984) points out that 
we still have no significant data on teaching methods or materials actually used by 
teachers in immersion (p. 46). With respect to social studies teaching, research has 
not shed light on the kind of social studies knowledge teachers select and organize 
for early elementary French immersion students. 

This article reports the major findings of a case study of two early elementary 
teachers’ perspectives on social studies teaching in French immersion (Mahé, 1984), 
and discusses the implications of the findings in the light of recent trends in 
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French immersion research. The argument advanced in this article is that unless 
teachers are encouraged to develop a commitment to the purpose and goals of so- 
cial studies within the immersion context, and unless a realistic social studies pro- 
gram is developed for early elementary French immersion which goes beyond the 
translation of English language documents, immersion students’ opportunities to 
develop social studies concepts and higher order thinking skills, such as inferential 
reasoning and problem solving abilities, may be limited. 


A current trend in research on teaching is based on the assumption that in 
order to understand teaching, teachers’ goals, judgments, and decisions must be 
understood in relation to their behavior in the classroom context (Shavelson & 
Stern, 1981, p. 455). Qualitative or naturalistic studies, including case studies, 
portrayals, participant observation, and ethnographic studies have yielded rich in- 
sights, ideas, and hypotheses suggesting that teacher decision making is influenced 
by a wide variety of external and internal variables (Delamont, 1976; Esland, 1971; 
Nelson & Cornbleth, 1982; Shavelson & Stern, 1981; Shaver, Davis, & Helburn, 
1979; Smith & Geoffrey, 1968; Wilson, 1977; Woods, 1985). 


Theoretical and Methodological Framework 


A symbolic interactionist perspective guided the researcher in her attempt to 
describe and explain how teachers’ perceptions of the French immersion situation 
influences their social studies knowledge selection and organization. Symbolic 
interactionists assume that the way individuals act toward things and others in sit- 
uations depends on how they define and interpret a variety of external and internal 
elements constituting their situation (Blumer, 1969; Hewitt, 1979; Mead, 1934). 


The case study method. The theoretical dictates of symbolic interactionism re- 
quire that an investigation has to take place in a natural setting, and that the re- 
searcher has to come to grips with the participants’ definitions and interpretations 
of their situation (Blumer, 1969; Hewitt, 1979). The case study method was select- 
ed for probing into the constructs underlying a grade one and a grade two teachers’ 
social studies decisions and actions, as it allows a researcher to carry out an inten- 
sive in-depth naturalistic investigation of persons in a specific situation (Black & 
Champion, 1976, pp. 90-91; Helmstadter, 1970, p. 51), and to get close to the par- 
ticipants in order to record their experiences from their point of view (Denzin, 
1970, p. 452). 


Naturalistic investigators suggest gathering data from different vantage points 
by using two or more complementary methods, and then triangulating or pitting 
against each other the data gathered from different methods in order to cross- 
check and verify the credibility and dependability of the data findings (Denzin, 
1970, pp. 459-460; Guba, 1978; Pohland, 1972, p. 11). A limited classroom partici- 
pant observation methodology, triangulated with unstructured and structured in- 
terviewing and document analysis was selected for gathering data and for 
generating and testing hypotheses, ideas, and insights representative of the two 
volunteer teachers’ perspectives. 


Data Gathering Techniques and Procedures 


The case study took place in the greater Edmonton, Alberta region. During the 
1980-1981 school term, intermittent field work was carried out for four months at 
the grade one level, and for six months at the grade two level. The grade one 
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teacher terminated the research project because of a pregnancy leave. The grade 
two teacher discontinued teaching social studies in April in order to prepare her 
students for the school board standardized tests. 


It was possible for the researcher to observe, at the grade one level, five social 
studies lessons for a total of 420 minutes, and to observe, at the grade two level, 17 
social studies lessons for a total of 843 minutes. During the classroom observations, 
the writer attempted to minimize her influence on the on going classroom activities 
by sitting in a corner of the room and recording in shorthand the verbal and 
nonverbal classroom interactions, and noting how the teachers and students 
interacted with different types of resource materials. 


Interviewing, according to Pohland (1972), complements observing and provides 
an evidential base which closely approximates the respondent’s reality (p. 12). The 
researcher interviewed the grade one teacher in the privacy of her classroom, gener- 
ally at the end of the school day, on 12 occasions for a total of 370 minutes, and in- 
terviewed the grade two teacher in an unoccupied resource room during the lunch 
hour on 10 occasions for a total of 287 minutes. With the teachers’ permission, the 
interviews were recorded on a cassette and shorthand field notes were kept on the 
key interview topics, the key responses, and of the documents examined. Following 
the interviews, the tapes and field notes were transcribed by the writer in 
typewritten form. The interview transcripts were given to the teachers for 
verification as to their accuracy and for generating additional data. The grade one 
teacher verified 84 pages of interview transcripts, and the grade two teacher 
verified 109 pages of interview transcripts. 

At the beginning of the project, the interview questions were generally 
unstructured in order to allow the teachers to tell their own stories in their own 
words (Blumer, 1969; Hewitt, 1979, p. 260; Li, 1981, pp. 45-46). Clarification ques- 
tions, based on the teachers’ previous statements, or questions based on the 
investigator’s classroom observations were asked when it appeared that the 
teachers had exhausted a topic (Li, 1981, pp. 45-51). As the project evolved, the 
writer structured her questions in order to test or verify her data interpretations 
and hypotheses with the teachers. 


Data Analysis Techniques and Procedures 

The perspective of symbolic interactionism assisted the researcher in identify- 
ing, from the data generated from different vantage points, categories of internal 
and external elements which appeared to influence the two teachers’ social studies 
decisions and actions. The data from the interview and classroom observation 
transcripts, and the document examination field notes were analyzed, categorized, 
and synthesized by the writer on an on going basis. Le Compte and Goetz (1982) 
and Guba (1978) recommend doing member checks of the researcher’s data inter- 
pretations to test whether or not the conceptual categories are also shared by the 
participants. Once during the project, and at the end of the project, the teachers 
verified the typewritten transcripts of data interpretations. 

Patterns of similarities and differences between the grade one and the grade 
two teachers’ situations were tested on an on going basis. Although many similari- 
ties were found between case studies, how each teacher defined and interpreted her 
situation was unique to her context, experiences, perceptions, beliefs, assumptions, 
knowledge, and emotional state of being. 
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Findings 
The two teachers were Franco-Albertans who had received their early education 
in bilingual schools offering one hour of instruction in French per day. The 
teachers, who had taught approximately seven years in bilingual and French im- 
mersion programs, had not received preservice training for French immersion or for 
teaching social studies. 


The two elementary schools in which the teachers taught fell under the jurisdic- 
tion of the same school board. A French immersion program attended by 
nonfrancophone students and a regular English program were offered in both 
schools. According to the grade one and grade two teachers, their students came 
from middle and upper middle class homes. In both schools administrators and 
librarians were English-speaking. The grade one teacher taught 32 students, and 
the grade two teacher taught 22 students. 


Teachers’ Definition of the Situation 


The data consistently suggested that the two teachers traded off the social 
studies concept and inquiry skill development goals prescribed in the 1978 Alberta 
Social Studies Curriculum (Alberta Education) for second language development 
goals. Some of the concepts to be developed at the grade one and two levels in- 
clude: environment, human needs, identity, institution, and interaction. The major 
inquiry skills to be developed are: gathering information, analyzing, evaluating and 
synthesizing data, resolving an issue, formulating a plan of action, and analyzing 
aspects of human values (pp. 9-24). 


The teachers’ lack of commitment to the goals of the social studies curriculum 
appeared to be influenced by their definition of the situation. From a symbolic 
interactionist perspective, the definition of the situation is a process in which indi- 
viduals, prior to acting, explore, examine, and feel out through behavior and 
thought the behavior possibilities of a situation. They will note and evaluate exter- 
nal expectations and demands, including the norms, values, and perceptions of par- 
ticular groups, and the limitations which the situation imposes upon their 
prospective line of action (Blumer, 1969, pp. 53-64; Hewitt, 1979, pp. 134-141). As 
individuals attempt to fit the external elements to their own internal elements such 
as their intentions, goals, and feelings (Blumer, pp. 53-60), they will form an atti- 
tude toward the situation, thereby defining the situation in one way and not in an- 
other (Blumer, p. 20; Waller, 1970, pp. 162-163). 


Perceptions of Parents and Administrators’ Expectations 


The two teachers claimed during interviews that parents and school administra- 
tors expect that immersion students will acquire French language skills while 
achieving as well in all areas of schooling as those students in the English program. 
The teachers considered such expectations as unrealistic. 

The grade one teacher explained that all subject matter programs for French 
immersion are translated from English into French. She said that before she can 
begin to teach a subject, she has to teach her students the necessary vocabulary. 
She believes that it is not realistic to expect students with different language learn- 
ing abilities to have to learn more at a much faster rate than students in the Eng- 
lish program. 

The grade two teacher stated that parents and administrators expect children 
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will speak French with ease after spending a year in the French immersion pro- 
gram, and that they will achieve as well, if not better, in all school subjects as those 
students in the English program. She viewed such expectations as being too much 
to ask of students who have different learning abilities. In April, she discontinued 
teaching social studies in order to prepare her students for the school board stand- 
ardized tests in French and English language arts, mathematics, and science. Social 
studies, she said, was not tested because it was not considered to be a basic subject 
in French immersion. She stated that the tests were a way for the school board to 
verify whether or not immersion students were achieving according to English and 
French program norms. She thought that the French tests originated from Ottawa. 
She described school system personnel as lacking an understanding of possible 
achievable goals in French immersion, and she was worried that if her students did 
not succeed on the tests, her reputation could be at stake. 


Perceptions of Social Studies and Second Language Learning 


Either during interviews or after classroom observations, the two teachers 
showed the researcher an English copy of the 1978 Alberta Social Studies Curricu- 
lum and copies of different school board designed social studies teaching units in 
French. According to the teachers, it is not possible in an immersion situation to 
implement the concept and inquiry skill development goals outlined in the provin- 
cial social studies document. The teachers’ lack of commitment to the goals 
outlined in the provincial social studies document appeared to be influenced by 
several factors such as: a lack of training and of human and material resources; 
their perceptions of consultants’ expectations; their perceptions of second language 
acquisition; and their knowledge, beliefs, and assumptions about teaching, learning, 
and child socialization. 

The lack of resources. The teachers, who had not received any formal training 
for social studies teaching, stated that without the assistance of competent French- 
speaking social studies consultants and suitable social studies materials in French, 
they could not see themselves implementing the provincial social studies concept 
and inquiry skill development goals. 

Consultants’ expectations. The teachers explained that the French consultants 
expect them to teach the vocabulary outlined in the school board designed social 
studies teaching units, but that it is up to the teachers to select their own method- 
ologies. Even though the teachers described the school board units as being made 
up of a hodgepodge of unrelated themes (grade one unit topics included the house, 
the seasons, Thanksgiving, Halloween, Christmas, clothing), and of “little recipes” 
designed to amuse children, the thematic and vocabulary basis for their classroom 
social studies curriculum originated mainly from these units. Upon examining the 
teaching units, the writer noted that these units did not provide the teachers with 
examples of concept and inquiry skill development activities. 

Second language acquisition. The grade one teacher explained that it takes stu- 
dents approximately six to eight months before they can interiorize enough French 
for their responses to become spontaneous. She described the second language 
learning process as follows: Firstly, students need to learn to listen so that they can 
hear the sounds in French. Then they need to repeat the words in French. Before 
sentences become automatic for them, they go through a “franglais” stage and will 
say, “Oh! I hurt my jambe!” and “I don’t have a gomme to effacer!” The grade one 
teacher stated that because of her students’ limited knowledge of French, it is not 
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possible for her to involve them in value clarification activities. She said that she 
does not have the time to ask 32 students questions such as: “How do you feel 
about such and such?” or “What do you prefer?” She therefore believes that social 
studies in French immersion can be no more than simple vocabulary development. 


The grade two teacher explained that during a discussion, students will either 
say nothing or use “franglais” or English. They will say a few words in French and 
expect her to fill in their sentence for them. She was concerned that, when she fills 
in a sentence for a student, she may not be correctly interpreting what the student 
has in mind. As far as social studies teaching is concerned, she finds that by the 
time she teaches her students the vocabulary in the social studies teaching units, 
she has little time left for concept development and inquiry oriented activities. 


Teaching and learning and child socialization. The investigator noted that the 
two teachers, when planning and teaching social studies, consistently included 
traditional teacher-controlled vocabulary development and application activities 
and receptive skill development activities such as: asking students descriptive ques- 
tions requiring a single correct answer; showing students pictures and asking them 
to repeat words and sentences; getting students to draw pictures representing new 
words learned; having students copy words in their scribblers; involving the class in 
singing songs containing the vocabulary taught; and reading to students stories 
containing newly introduced vocabulary. 


The two teachers during interviews described themselves as including citizen- 
ship development goals in their social studies teaching. They interpreted citizen- 
ship development as child socialization and life skills development. The writer 
noted that, during classroom observations, both teachers generally encouraged their 
students to share, to be patient and tolerant towards others, and to obey classroom 
rules. 


Discussion and Implications 


Influenced by a complex interplay of external and internal elements, the 
teachers, when planning and teaching social studies, traded off concept and inquiry 
skill development goals for second language development goals, and they consist- 
ently involved their students in teacher-controlled vocabulary and receptive skill 
development activities. One could argue that if social studies in French immersion 
is taught using traditional second language teaching methodologies, then immer- 
sion students’ opportunities to develop important citizenship goals and to practice 
inquiry skills such as information seeking, analyzing and evaluating information 
and values in conflict, drawing conclusions, making inferences, developing concepts 
and generalizations, and solving personal and social problems, may be limited. 


The Need for a Commitment to the Goals of Social Studies 


The two teachers’ decisions to trade off social studies concept and inquiry skill 
development goals for second language development goals, especially vocabulary 
development, and the development of receptive skills, may be a normative practice 
in other immersion contexts. 

There is evidence which seems to suggest that the importance of language de- 
velopment is being overemphasized in French immersion (Shapson, 1984). 
Wightman (1979), for example, has observed that in early grades, the immersion 
teacher’s initial step when introducing a new topic for study is to develop students’ 
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vocabulary (p. 59). There is further evidence which suggests that receptive skill de- 
velopment, rather than productive skill development, is a primary objective in 
French immersion. For example, Lapkin (1984) has noted that for years the pri- 
mary objective in the early stages of any French immersion program was to ensure 
the development of students’ receptive skills (p. 575). Harley (1984), who con- 
ducted an in-depth study of elementary immersion students’ grammar, discourse, 
and sociolinguistic competencies has found that immersion students have a re- 
markable ability to comprehend spoken and written discourse. She attributes these 
abilities to the methodology actually used in French immersion, such as students 
listening to the teacher talk and reading subject matter texts. 


Recently, Long and Porter (1985) have criticized the “comprehensible input” 
approach to second language acquisition, and have questioned whether simply 
hearing and understanding the new items are sufficient for a learner to use the lan- 
guage. They call for the need to give students opportunities to produce new forms 
of language in group work (p. 214). Likewise, Allen (1985) sees the need to use lan- 
guage for real purposes in a student-centered class which gives students “a chance 
to communicate their thoughts and feelings in a second language” (p. 996). Allen 
also recommends asking students “evaluative” questions rather than factual ques- 
tions, as evaluative questions give them a chance to communicate their thoughts 
and feelings, without having to worry about errors in grammar or pronounciation 
and can result in a higher level of communication (pp. 995-996). According to Long 
and Porter (1985), even if students make errors, these should be viewed as a 
“healthy” part of language development (p. 223). Harley (1984) has found that im- 
mersion students are capable of responding in French in a manner that is usually 
coherent in the context, and that they will make use of nonverbal communication 
strategies such as gesture or mime to get their meaning across. She has noted that 
without having mastered conditional verb forms (verb forms necessary for express- 
ing a hypothetical situation in social studies), grade five students will use a variety 
of verb forms such as the present tense together with an adverb, or the future con- 
struction with an infinitive, to get the essential meaning across. 

Calling upon second language teachers to shift from a teacher-centered, recep- 
tive skill development approach, to a student-centered, productive skill develop- 
ment approach may be problematic. There is considerable evidence which suggests 
that teacher talk and receptive skills are also being emphasized in English class- 
rooms (Ponder & Davis, Jr., 1982; Shaver, Davis, & Helburn, 1979; Sirotnik, 1983; 
Stodolsky, 1984; VanSickle, 1985). Resistance to the new social studies is also a 
case in point. Expository teaching, primarily via textbook and teacher talk, con- 
tinues to dominate practice in most of today’s social studies classrooms (Engle, 
1986, p. 20). Engle points out that in most schools, the intellectual and social cli- 
mate and governance are “unfavorable to open, questioning education over which 
the learner exercises some measure of control” (p. 22). Sirotnik (1983) explains that 
the greatest difficulty facing teachers today who may want to adopt a more 
student-centered or inquiry approach to instruction, is that they must overturn an 
enduring stereotypical image of what is a “good” classroom, or what Enright and 
McCloskey (1985) refer to as the “No talking!” image. Teachers, Engle (1986) 
argues, are the products of a “mindless education,” and they are “poorly prepared 
to help students learn the skills of social criticism that are so important to democ- 
racy” (pp. 21-22). 

Preservice and inservice teacher education. If teachers and future teachers of 
French immersion are to develop a commitment to the purpose and goals of social 
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studies, there is a need during their preservice and inservice training to: (a) allow 
them to inquire into the problems and questions of various social science disci- 
plines; (b) help them to understand that the goals of immersion and of social 
studies programs go far beyond the mastery of vocabulary and grammatical struc- 
tures, and that language learning is not an end in itself, but rather a tool for learn- 
ing something else; (c) make them aware that social studies concept and inquiry 
development approaches have many similarities with the communicative compe- 
tence approach, in that both approaches are learner-centered and focus on helping 
students develop meaning, see the relationships involved in the materials studied, 
and develop their inferencing and problem solving abilities; and (d) give them op- 
portunities to practice in a laboratory setting (Berliner, 1985) communicative com- 
petence and inquiry skill development techniques such as simulations, group work, 
role playing, and questioning strategies for problem solving and inferencing. 


The need for a realistic social studies program. The finding from the case 
study, that French immersion students are expected by parents and administrators 
to achieve as well as students in the English program, may be a normative expecta- 
tion in other immersion milieus. The literature on programs of studies for French 
immersion across Canada (Conseil des Ministres de l|’Education, Canada, 1983; 
Lapkin, 1984) confirms that immersion students are expected to study the same 
content in all subject areas as their peers in the regular English program, and that 
subject matter programs are generally translated from English into French. 


Integrating communicative competence goals and social studies goals. There 
appears to be a need to develop a realistic social studies program for early elemen- 
tary French immersion which goes beyond the translation of English documents 
into French. Such a program could incorporate examples of social studies concept 
and inquiry development activities integrated with communicative competence 
techniques suggested by second language researchers, notably Enright and 
McCloskey (1985), Long and Porter (1985), Maiguashca (1984), and Yalden (1984). 
A more controlled approach to concept development would have to be considered 
in such a program. The vocabulary for dealing with the attributes of the concepts 
would probably have to be specified. As well, examples of grammatical structures 
could be included which would allow students to respond to analytical and 
evaluative types of question. 


Research and debate. Unless the question of trade-offs in French immersion is 
further explored and openly debated, what appears to be a successful innovative 
approach to second language learning could limit students’ opportunities to de- 
velop critical thought. There is, therefore, a need for research which could provide 
insights into how immersion teachers’ beliefs, assumptions, and their second lan- 
guage teaching strategies can either enhance or limit students’ opportunities to de- 
velop social studies concepts and to acquire the knowledge, skills, and values 
needed for democratic citizenship. 


References 


Alberta Education. (1978). 1978 Alberta social studies curriculum (Interim ed.). Edmonton: 
Alberta Education. 


Allen, E.D. (1985). Communicative competence and levels of proficiency. The Canadian 
Modern Language Review, 41(6), 991-999. 


208 


Social Studies Teaching in French Immersion 


Barik, H.C., & Swain, M. (1976). Primary-grade French immersion in a unilingual English- 
Canadian setting: The Toronto study through grade two. Canadian Journal of Education, 
1(1), 39-58. 

Berliner, D.C. (1985, November-December). Laboratory settings and the study of teacher edu- 
cation. Journal of Teacher Education, 2-8. 

Bibeau, G. (1984). No easy road to bilingualism. Language and Society, 12, 44-47. 

Black, J., & Champion, D.J. (1976). Methods and issues in social research. New York: John 
Wiley. 

Blumer, H. (1969). Symbolic interactionism. Perspective and method. Englewood Cliffs, NJ: 
Prentice-Hall. 

Calgary Catholic Schools. (1979). Report on Calgary Catholic Schools French immersion pro- 
gram, E.C.S. and grade one, 1978-1979. A research review and comparative evaluation. 
Calgary, Alberta. 

Conseil des Ministres de l’Education (Canada). (1983). Actualités provinciales. Dialogue, 1(4). 

Council of Ministers of Education (Canada). (1982). Manitoba. The French immersion research 
project. Dialogue, 1(2). 

Cummins, J. (1976). The influence of bilingualism on cognitive growth. A synthesis of research 
findings and explanatory hypotheses. In M. Swain & G. Dumas (Eds.), Working papers on 
bilingualism, 9, (pp. 1-43). Toronto: The Ontario Institute for Studies in Education. 

Cummins, J. (1983). Research findings from French immersion programs across Canada: A 
parent’s guide. CPF Special Report. Ottawa: Canadian Parents for French. 

Day, E.M., & Shapson, S. (1985). Assessing oral communicative skills in immersion programs. 
CAIT News, 8(3), 15-17. Ottawa: Canadian Association of Immersion Teachers. 

Delamont, S. (1976). Interaction in the classroom. London: Methuen. 

Denzin, N.K. (1970). The methodologies of symbolic interaction: A critical review of research 
techniques. In G.P. Stone & H.A. Farberman (Eds.), Social psychology through symbolic 
interaction (pp. 447-465). MA:Ginn. 

Engle, S. (1986). Late night thoughts about the new social studies. Social Education, 50(1), 
20-22. 

Enright, D.S., & McCloskey, M.L. (1985). Yes, talking! Organizing the classroom to promote 
second language acquisition. TESOL Quarterly, 19(3), 431-453. 

Esland, G.M. (1971). Teaching and learning as the organization of knowledge. In M.F.D. Young 
(Ed.), Knowledge and control. New directions for the sociology of education (pp. 70-116). 
London: Collier-Macmillan. 

Genesee, F. (1984). French immersion programs. In 8. Shapson & V. D’Oyley (Eds.), Bilingual 
and multicultural education: Canadian perspective (pp. 33-34). Avon, England: 
Multilingual Matters. 

Gray, V.A., & Cameron, C.A. (1975). Evaluation of early French immersion in Fredericton, NB 
(Interim Report). Fredericton: Psychology Department, University of New Brunswick 
(mimeo). 

Guba, E.G. (1978). Criteria for assessing the trustworthiness of naturalistic inquiries 
(ERIC/ECTJ Annual Review Paper, 29, 75-91). Syracuse, NY: Syracuse University. 

Harley, B. (1984). How good is their French? Language and Society, 12, 55-60. 

Helmstadter, G.C. (1970). Research concepts in human behavior. New York: Appleton-Cen- 
tury Crofts. 

Hewitt, J.P. (1979). Self and society: A symbolic interactionist social psychology (2nd ed.). 
Toronto: Allyn and Bacon. | 

Krashen, S.D. (1984). Immersion: Why it works and what it has taught us. Language and Soci- 
ety, 12, 61-64. | 

Laberge, E. (1984). The French immersion programme in St. Norbert. Contact, 3(4), 9-11. 


209 


Yvette T.M. Mahé 


Lapkin, S. (1984). How well do immersion students speak and write French? The Canadian 
Modern Language Review, 40(4), 575-585. 


Lapkin, S., & Swain, M. (1984). Research.update. Language and Society, 12, 48-53. 

LeCompte, M.D., & Goetz, J.P. (1982). Problems of reliability and validity in ethnographic re- 
search. Review of Educational Research, 52, 31-60. 

Li, P. (1981). Social research methods. Toronto: Butterworth. 

Long, M.H., & Porter, P.A. (1985). Group work, interlanguage talk, and second language acqui- 
sition. TESOL Quarterly, 19(2), 207-228. 

Mahé, Y.T.M. (1984). Social studies teaching and French immersion: A case study of two ele- 
mentary teachers’ perspectives. Unpublished doctoral dissertation, University of Alberta. 

Maiguashca, R.U. (1984). Semantic fields: Towards a methodology for teaching vocabulary in 
the second-language classroom. The Canadian Modern Language Review, 40(2), 274-297. 

Mead, G.H. (1934). Mind, self and society. (C.W. Morris, Ed.). Chicago: University of Chicago 
Press. 

Nelson, M., & Cornbleth, C. (1982). Social studies research and classroom teachers: Open the 
door wider. Social Education, 46, 218-220. 

Pohland, P. (1972). Participant observation as a research methodology. Studies in Art Educa- 
tion 13. 


Ponder, G., & Davis, O.L. Jr. (1982). Social studies education. In H.E. Mitzel (Ed.), Encyclope- 
dia of educational research (5th ed.) (4, pp. 1723-1732). New York: American Educational 
Research Association. 

Shapson, S.M. (1984). Personal comments on immersion in Canada. Contact, 3(1), 2-3. 

Shapson, S., & Kaufman, D. (1976). French immersion: A western perspective. Canadian Soci- 
ety for the Study of Education, Year Book, 3, 8-25. 

Shapson, S., & Kaufman, D. (1978). Overview of elementary French programs in British Co- 
lumbia: Issues and research. The Canadian Modern Language Review, 34(3), 586-603. 
Shavelson, R.J., & Stern, P. (1981). Research on teachers’ pedagogical thoughts, judgments, de- 

cisions, and behavior. Review of Educational Research, 51, 455-498. 


Shaver, J.P., Davis, O.L., & Helburn, S.M. (1979). An interpretive report on the status of 
precollege social studies education based on three NSF funded studies. Social Education, 
43, 3-18. 

Sirotnik, K. (1983). What you see is what you get: Consistency, persistency, and mediocrity in 
classrooms. Harvard Education Review, 53(1). 


Smith, L.M., & Geoffrey, W. (1968). The complexities of an urban classroom: An analysis 
toward a general theory of education. New York: Holt, Rinehart & Winston. 


Stern, H.H. (1984). The immersion phenomenon. Language and Society, 12, 4-8. 

Stodolsky, S.S. (1984). Teacher evaluation. The limits of looking. Educational Researcher, 
13(9), 11-18. 

Swain, M., & Lapkin, S. (1981). Bilingual education in Ontario: A decade of research. Toronto: 
Ontario Institute for Studies in Education. 


Swain, M., & Lapkin, S. (1983). Evaluating bilingual education: A Canadian case study. Avon, 
England: Multilingual Matters. 


VanSickle, R.L. (1985). Research implications of a theoretical analysis of John Dewey’s How 
We Think. Theory and Research in Social Education, 13(3), 1-20. 


Waller, W. (1970). The definition of the situation. In G.P. Stone & H.A. Farberman (Eds.), So- 
cial psychology through symbolic interaction (pp. 162-174). Waltham, MA: Ginn. 


Wesche, M.B. (1984). A promising experiment at Ottawa University. Language and Society, 
LZ yp 20-253 


210 


Social Studies Teaching in French Immersion 


Wightman, P. (1979). French immersion: An educational alternative. In B. Mlacak & 
E. Isabelle (Eds.), So you want your child to learn French! A handbook for parents 
(pp. 54-65). Ottawa: Canadian Parents for French. 


Wilson, S. (1977). The use of ethnographic techniques in educational research. Review of Edu- 
cation Research, 47, 245-265, 


Woods, P. (1985). Sociology, Ethnography and teacher practice. Teaching and Teacher Educa- 
tion, 1(1), 51-62. 


Yalden, J. (1984). The design process in communicative language teaching. The Canadian 
Modern Language Review, 40(3), 398-413. 


211 


The Alberta Journal of Educational Research Vol. XXXII, No. 3, September, 1986, 212-222. 


WALTER B. BOLDT 


and 


BILLIE HOUSEGO 


University of British Columbia 


Critical Incidents in the Supervision of 
Student Teaching: 

A Cluster Analysis of the Perceptions of 
Graduate Transfer Students 


To gain further understanding of student teachers’ perceptions of their experiences 
in having their teaching supervised, student teachers were asked to rate a sample of 
critical incidents in the supervision of student teaching on three, seven-point, bipolar 
scales anchored by the terms realistic-unrealistic, tmportant-unimportant, and 
manageable-unmanageable. They were asked three questions concerning these 
incidents: (a) To what extent do these incidents reflect the reality of the supervision 
experience? (How common are these incidents in the supervision of student teaching?) 
(b) To what extent are these incidents critical or important in the supervision of 
student teaching? (Would discussing these incidents with your supervisors affect your 
teaching performance?) (c) To what degree are the problems described in the 
incidents ones on which student teachers can take some action; how manageable are 
they? (Could you respond in some meaningful way to such issues?) The rating profiles 
of the incidents over the three scales were compared through hierarchical cluster an- 
alysis. Three clusters, personal adjustment incidents, lesson preparation incidents, 
and housekeeping rules of conduct incidents along with indications of their perceived 
realism, importance, and controllability emerged. The results confirm the view that 
“survival type” incidents were central in the student teachers’ supervision experi- 
ences and that supervisors of student teaching are, for the most part, ideological in 
their supervisory approach. 


The all-encompassing goal of supervising student teaching is to ensure that the 
student teacher becomes proficient in teaching. This assumes that a supervisor, 
whether a classroom teacher or a university professor, can identify effective 
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teaching and guide the student teacher toward that goal, that he or she can analyze 
the discrepancy between “what is” and “what ought to be” and meet the student 
teacher’s need for “a sustained, expert program to help him/her relinquish . . . ex- 
isting classroom behavior in favor of new behavior” (Cogan, 1973, p. 4). More fun- 
damentally, it also assumes that supervisors are able to perceive teaching in the 
manner that student teachers perceive it, and that they can accommodate their 
supervisory behavior to these perceptions in order to bring about desired teaching 
behaviors. Basic questions about what supervisors actually do in an effort to be 
helpful must be asked if the supervisory process is to be improved; moreover, it is 
important to know how student teachers perceive the help provided them. Which 
actions on the part of supervisors are perceived to contribute to improved teaching 
and which are viewed as less helpful? Supervisors of student teaching vary consid- 
erably in the approaches they use in assisting and providing feedback to their 
supervisees. A number of such approaches are summarized in a model of student 
teaching supervision described by Housego and Grimmett (1983). 


The categorization of supervisory behavior proposed by Housego and Grimmett, 
and shown in Figure 1, is rooted in the supervisor’s need to take account of two 
major concerns, to reduce the threat often felt by student teachers and to make the 
best possible use of the limited time available in brief periods of school experience. 
As shown on the left-hand side of Figure 1, if threat concerns are uppermost, a 
supervisor attempts to be facilitative or understanding and assuring. If the 
supervisor utilizes a research-based knowledge of teaching, supervision can go be- 
yond being purely indulgent, a so-called “tea party” (Goldhammer, 1969, p. 61), to 
become to a greater degree knowledge-based or deliberative. The supervisor 
functioning in the deliberative mode “is able to discuss with the student teacher 
alternative strategies for achieving desired outcomes with pupils—outcomes contin- 
gent upon learner characteristics and the nature of the classroom environment” 
(Housego and Boldt, 1984, p. 170). The deliberative supervisor is knowledgeable 
with respect to effective teaching, whereas the indulgent supervisor is simply a 
kind, concerned person. 

When shortage of time concerns are uppermost, the supervisor may assume a 
directive role, as shown on the right-hand side of Figure 1, and, depending on the 
degree to which supervision is knowledge-based, either impose strong views about 
teaching on the student teacher in an ideological mode of supervision or, with a 
greater knowledge of teaching, proceed in a “technological” mode in which “teach- 
ing is regarded as being composed of a definable repertoire of knowledge, skills, 
and attitudes that a teacher brings to bear in an effort to create certain changes in 
learners” (Zumwalt, 1982, pp. 223-224). The technological supervisor, like the delib- 
erative supervisor, is knowledgeable with respect to effective teaching, whereas the 
ideological supervisor may know from experience “what works” and insist that the 
student teacher adopt those approaches. 

Assuming either a predominantly facilitative supervisory role or a predomi- 
nantly performance-based one is likely to create an undesirable polarity. If neither 
position rests in research-based knowledge of teaching, the resulting entrenched 
either ideological or indulgent supervision modes readily degenerate to become im- 
poverished supervision as shown at the bottom of Figure 1, What is needed is 
greater flexibility on the part of supervisors, the adoption of a so-called 
metadeliberative position (see Figure 1) in which “throughout the process of practi- 
cum supervision . . . supervisors constantly deliberate . . . on which of two 
supervisory modes most appropriately meets the apparent and most pressing needs 
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of the specific practicum context in which they must supervise at a given time” 
(Housego and Grimmett, 1983, p. 333). 


The dual concerns of time and threat are thus taken into account—technologi- 
cal supervisors going beyond merely presenting research proven skills and having 
student teachers practice them, to also concentrate as required, on students’ needs 
to feel secure and confident in the school environment, and deliberative supervisors 
extending their “research-embracing” facilitative approach to engage student 
teachers in practicing and receiving feedback on specific instructional behaviors. 
Emphasis may shift and at times be more towards pupil learning and instructional 
performance and yet, at other times, move more toward developing the confidence 
and professional autonomy of the student teacher. 


Theoretically, a metadeliberative approach to supervising student teaching has 
great appeal; however, supervisors’ abilities to function in this mode depend on 
their being able to understand student teachers’ perceptions of their classroom and 
supervisory experiences—the students’ reality. 


In an effort to explore this reality, a study of critical incidents occurring in the 
supervision of student teaching was undertaken (Housego & Boldt, 1984). One hun- 
dred, twenty-five critical incidents in the supervision of student teaching were col- 
lected from elementary and secondary student teachers with varying backgrounds 
and experience in education, from supervision teachers, and from faculty 
supervisors. Because it was expected that incidents in student teaching supervision 
would stem from the total spectrum of problems encountered in teaching, the 
incidents were categorized according to Broudy’s (1965) categories of problems of 
practice which characterize the education profession: aims and _ objectives, 
organization-management, curriculum, and teaching-learning. It was found that 
nearly 50% of the supervision incidents grew out of concerns with organizing and 
managing instruction. In another 30% of the supervisory incidents identified as 
critical, interpersonal relationships between student teachers and their supervisors 
were predominant. This necessitated the formation of an additional, so-designated 
category beyond Broudy’s classifications. Few supervisory incidents, about 16%, 
were rooted in substantive problems in teaching and learning. Critical supervisory 
incidents having to do with educational goals or curriculum-based problems were 
conspicious by their absence. These findings generated additional questions, partic- 
ularly with respect to the relative importance of the incidents, the degree to which 
they are truly representative of the experiences of student teachers generally, and 
the degree to which students perceive that they can manage or take action to rem- 
edy the problems described in some of the incidents. Accordingly, critical incidents 
contributed by student teachers were isolated and a decision made to ask another 
somewhat homogeneous group of student teachers to respond to such questions in 
a second phase of the study, an account of which is presented in this article. 


Method 
Subjects 


It was expected that subjects would respond to critical incidents according to 
their experience and background in education. To begin mapping these responses, 
45 student teachers from the one-year post-baccalaureate program of the Faculty of 
Education at the University of British Columbia were selected to provide the data 
to be studied. They constituted about 20% of the student teachers in this program. 
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Figure 1: A typology of approaches and the metadeliberative perspective on supervision. 
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The educational experiences of these subjects included a degree from another 
faculty, and, at the point where the data were collected, five months of professional 
studies ranging from philosophy of education to methods of teaching various school 
subjects, plus seven weeks of school experience. In this one-year program, the 
student teachers are relatively homogeneous in terms of background and experi- 
ence in education and, upon successful completion of the program, are eligible for 
certification as teachers. These were considerations in selecting the subjects in 
order to ensure reasonable uniformity and comprehensiveness in the background 
and experiential knowledge on which their responses to the critical incidents would 
be based. The 45 subjects were also drawn in such a way as to ensure reasonable 
representation of elementary and secondary preservice teachers, the latter group 
including a cross section of subject matter specialists. 


Items 


Since not all 125 incidents collected in the first phase of the study could be 
used in the second phase, and in order to avoid any systematic bias in selecting a 
feasible number of items, a random sample of 30 incidents was drawn from the 
pool of student-generated incidents. The sample size of 30 was determined on the 
basis of practical considerations such as time and the fatigue of the respondents. 
Trials indicated that about 30 items would be manageable. 


Instrument 


From the standpoint of using the results of this study for the improvement of 
student teaching supervision, particularly in the one year program, it seemed im- 
portant to determine the degree to which subjects would see the incidents: first, as 
realistic, second, as important in terms of their own experience in being supervised, 
and third, to the extent that incidents reflected problems, as manageable under the 
constraints of the student teaching experience. The instrument devised to obtain 
this information consisted of three 7-point, bipolar adjectival scales anchored by 
the terms realistic-unrealistic, important-unimportant, and manageable- 
unmanageable applied to each of the 30 items as shown in the following sample 
item. 


When pupils argued and failed to help set up P.E. equipment, my fac- 
ulty advisor gave me explicit directions as to how to handle setting up 
equipment efficiently. 


REALIST IG: ae error etree eaters terran etree eee : UNREALISTIC 
Va On Omer ier a. oem 

IMPORIDAIN Das, sige atevs cite nctauastrok ait cekreret mamet cet : UNIMPORTANT 
(Oe eee Eh eos ll 

MANAGEABLE Favccice-cte ect meneame teeny at : UNMANAGEABLE 


hE OR COge toe teks 1 


Analysis 

An analysis of the ratings for each of the 30 incidents on each of the three 
scales was carried out in order to determine whether or not the scales would serve 
to differentiate among the incidents and, in addition, if the three scales each would 
differentiate among the incidents in a unique way. The method of analysis used 
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was a repeated measures, analysis of variance design, with one within-subjects fac- 
tor (scales) and one between-subjects factor (incidents). The number of observa- 
tions in each of the 90 cells was 45. The three scales, realism, importance, and 
control constituted the three levels of the within-subjects factor and served as 
bases of judgment for differentiating between the incidents. The scales also allowed 
the subjects to indicate the degree to which they perceived each basis of judgment 
(scale) to apply to the incidents. By this method, then, the way student teachers 
perceived critical incidents in the supervision of student teaching from the 
standpoint of how realistic the incidents were, how important the incidents were in 
their experience, and how manageable they considered the incidents to be from 
their perspective, were obtained. 


Results 
Repeated-Measures Analysis 


Table 1 gives the results of the analysis of the ratings of the incidents on the 
three scales. It shows a significant difference among scales, a significant difference 
among incidents, and significant interaction effects between scales and incidents. 
These results were interpreted as indicating that, in terms of the bases of 
judgments represented by the scales, the student teachers did perceive the 
incidents differentially and, hence, provided at least some insight into what topics 
or areas in the supervision process they experienced that they perceived to be real- 
istic, important, and manageable. For supervisors, then, it would seem advisable to 
accommodate these perceptions in planning a metadeliberative approach. 


TABLE 1 


Analysis of Variance Based on Repeated Measures 


Source of Sum of Df Mean F 
Variation Squares Square 

A 833.00 29 28.76 Hfeallag fe 
S-Within 5289.31 Tang 4.01 

B 959.04 2 479.52 OMe 
AB 283.62 58 4.89 2>.76* 
BS-Within 4672.56 2638 rir 

po —a0n Oi 


Hierarchical Cluster Analysis 

The hierarchical clustering analysis compares the rating profiles of each 
incident over the three scales, obtains a measure of the similarity of the profiles of 
each incident over the three scales, obtains a measure of the similarity of the 
profiles, and progressively groups the incidents in such a way that the rating 
profiles are as similar as possible and the variation of the ratings in each group 
formed is as small as possible (Ward, 1963). At each step in the grouping process, 
the within-group variation (error) is minimally increased. A substantial increase in 
the error at a particular step suggests the beginning of a new cluster or group. 
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Table 2 reproduces the essential groupings displayed in the tree-diagram, the 
identification number of the incidents, and the mean scores of the incidents on 
each of the three scales. é 


Interpretation of the clusters of incidents in Table 2 was based on a close exam- 
ination of the interrelationships of the substantive content of the incidents as well 
as the mean ratings on the scales. Beginning with the innermost cluster in each of 
the three major categories (A, B, and C) and working outwards, the following inter- 
pretation of the three groupings of clusters of critical incidents in the supervision 
of student teaching was obtained: 


Category A (Supervision Incidents Emanating from Personal Adjustment 
Problems) 


Many of the problems of practice central to the incidents in Category A are 
what Brophy (1983, p. 282) would refer to as “personal adjustment problems.” In 
particular, disruptive self-criticism and coping with unusually disruptive behavior 
in individual pupils are the focal problems. Other clusters in this category included 
personal adjustment to somewhat less disruptive behaviors of the class as a whole 
(noise, inattention) and of oneself (poor planning habits). All the incidents in this 
category were rated as realistic, important, and manageable. These results suggest 
that coping personally with unusually disruptive behaviors on the part of oneself 
and, especially, the class is still very much a part of the experience of the subjects. 
Moreover, as Tardif (1985) observes, managing to control one’s class is an integral 
part of one’s “teacher identity.” As Fuller and Brown (1975, p. 37) might view 
them, the supervision incidents in Category A appear to relate largely to teaching 
problems which could be categorized as “survival concerns,” representative of the 
first stage in learning to teach. The supervisory approach, in most instances, was to 
supply some fairly precise suggestions or directives as to what to do. Seldom do the 
student teachers’ critical incident accounts of the responses of their supervisors re- 
veal that alternative strategies or research-based knowledge were discussed. The 
supervisory approach could be seen as largely ideological, although a more 
facilitative approach might have been expected. 


Category B (Supervision Incidents Emanating from Lesson Presentation 
Problems) 


This category shows a trend away from basic survival concerns toward concerns 
about teaching a “good” lesson. Specifically, the supervision incidents relate to 
problems of presenting an unappealing lesson, teaching difficult concepts, keeping 
the attention of the class, dealing with talkative students during the lesson presen- 
tation, and concluding the lesson successfully. The subjects rated these supervision 
incidents as realistic, very important, and quite manageable. Overall, the incidents 
in this category received the highest ratings. While these incidents were still closely 
tied to survival concerns, they do suggest a transition toward a somewhat more ad- 
vanced focus in the supervision of student teaching, a stage in which the problems 
have to do with group management during active instruction and with lesson pre- 
sentation itself. One would expect that such problems would most often be ap- 
proached in a somewhat more direct and technological mode of supervision 
(Housego & Grimmett, 1983, p. 324). Although the supervisor’s response in dealing 
with these problems was generally direct, there is no evidence in the student 
teachers’ critical incident statements of their supervisor’s responses that learnings 
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based on research were drawn to their attention. The approach was again 
ideological, primarily offering “craft-rules” for dealing with problems. 


Category C (Supervision Incidents Emanating from Housekeeping and Rules 
of Conduct Problems) 


The supervision incidents in Category C can be characterized according to 
Brophy (1983, p. 282) as relating to the “development of a workable set of house- 
keeping procedures and conduct rules.” The characteristic incidents deal with pre- 
paring and distributing materials and equipment, handling incidents, checking 
homework, using the blackboard efficiently, getting the attention of the class, eval- 
uating achievement, and planning a suitable lesson. Based on the ratings, these 
supervision incidents are seen as only marginally realistic, somewhat important, 
and manageable. Supervisory incidents in this category, too, are likely to be ap- 
proached in a technological mode, lending themselves as they do to specific pre- 
scriptions for action. Supervisors appear to have been quite directive, relying 
heavily on craft-rules. There is little evidence in what the student teachers report 
of supervisor reaction, that supervisors use research-based knowledge of teaching to 
engage student teachers in discussions of alternative problem solving strategies. 


In summary, the results suggest, that despite the category of incidents under 
consideration, according to the student teachers’ critical incident reports, 
supervisors tended to be largely ideological in their approach, using craft-rules 
more than research-based knowledge of teaching, and, therefore, being unlikely to 
provide student teachers with the intellectual maneuverability which would in- 
crease their ability to consider and select from a variety of problem solutions. 


Conclusions 


Critical incidents in the supervision of student teaching can be identified by 
any member of the student teaching triad: the student teacher, the university 
supervisor, or the sponsor teacher. The major thrust of this study was to examine 
more closely a sample of critical incidents in the supervision of student teaching 
generated by student teachers. In addition to the usual method of categorizing pro- 
cedures typical of critical incident analysis (Housego & Boldt, 1984), the writers in 
this study sought to enhance the analysis through the use of an empirical tech- 
nique, cluster analysis, to identify focal points for more intensive study. The cluster 
analysis did group the incidents in an interesting and interpretable way. 
Specifically, the analysis produced a differentiation among these incidents which 
agreed with that postulated by Fuller and Bown (1975) at the first stage of learning 
to teach, the stage of “survival concerns,” and with the centrality of the problems of 
classroom organization and management described by Brophy (1983). The cluster 
analysis of the incidents also provided a useful adjunct to the usual mode of re- 
search in critical incident studies, in providing explicit empirical information on 
how the clusters were developed and how the incidents in each cluster were interre- 
lated. 


On the substantive side of the study, as opposed to the methodological, the re- 
sults draw attention to needed training in both the craft aspects of professional 
activity and the application of pedagogical knowledge. Brophy (p. 266) notes that, 
in practice, the two key teaching tasks, classroom management (craft aspects of 
teaching) and instruction (pedagogy) are interdependent. This interdependence is 
clearly evident in the tree-diagram of the clusters making up the three major 
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categories shown in Table 2. In addition, Brophy contends that, 


Clear and detailed information is now available about how successful teachers organize 
and manage their classrooms . . . if applied systematically, these principles will enable 
teachers to establish their classrooms as effective learning environments and to cope with 
most conduct problems. (p. 266) 
The cluster analysis shows that student teachers attach substantial importance to 
such incidents and that they are viewed as very characteristic of their school and 
related supervision experiences. This is of relevance to supervisors of student 
teaching, suggesting the applicability of an approach to supervision such as that 
detailed in Figure 1, in which both the technology (craft-like aspects of teaching 
based on research) and concerns with the development of the student teacher’s 
confidence and autonomy come together according to the immediate needs of the 
student in a particular situation. 


Student teachers’ perceptions of supervision experience require further study. 
They must be taken into account by effective supervisors and included in meaning- 
ful supervisory training programs. Considering the evidence provided by this study, 
one becomes aware of the desirability of supervisors, whether classroom teachers or 
university faculty, drawing on the presently available research-based knowledge of 
teaching to enable themselves to assume a metadeliberative approach and thereby 
enhance the integration of the school experience and course work components of 
teacher preparation. In our view, the supervisory process includes dealing with 
many of the factors to which Campbell (1985) refers in her study of the variables 
affecting the instruction behavior of physical education student teachers during 
their practicum. 
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This study sought to determine what percentage of first time English-speaking 
kindergarten entrants could be identified as early readers (ERS), and to identify dif- 
ferences between ERS and nonearly readers (NERS) with respect to their home back- 
ground, home learning experiences, and book orientation skills. In September 1984, 
all first-time kindergarten entrants in the Saskatoon Public School System were 
screened using the Durkin Word Test (1966) and the Durrell Analysis of Reading 
Difficulty (1980). Twenty students identified as ERS were randomly matched with 
NERS from their classrooms. All students were then administered the Book 
Orientation Scale. Additional information was obtained through a parent question- 
naire and interviews. Analysis of the data led to the following conclusions. ERS repre- 
sented 1.64% of the population of kindergarten entrants. Demographic differences 
between ERS and NERS were similar to those found in previous research studies. The 
home environments of ERS appeared to have been conducive to the development of 
skill in reading and to the development of strategies for learning in a wide range of 


areas. 


The process by which children acquire literacy skills at an early age has been, 
and continues to be, the focus of much discussion and research. In 1908, Huey de- 
scribed the “natural reader” or the preschool child who learns to read at home with- 
out having participated in formal reading instruction. In more recent years, a 
number of comparative studies have attempted to establish why some children 
enter kindergarten already reading (Carlson, 1975; Clark, 1976; Durkin, 1966; 
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Evans & Smith, 1976; Getman, 1979; Harty, 1975; King & Friesen, 1972; Manning 
& Manning, 1984). 


Logically, a knowledge of the number of kindergarten entrants who can be iden- 
tified as early readers is important for long- and short-term planning on the part of 
school jurisdictions. The literature raises some questions as what percentage of 
kindergarten entrants can be expected to be early readers. In 1966 Durkin found 
less than one percent of the population could be identified as early readers, while 
in 1972 King and Friesen identified just over one percent of the population as early 
readers. Harty (1975) found six percent of the population to be reading at the time 
of school entrance. Three years after Harty’s study, Durkin (1978) stated that more 
and more preschoolers were learning to read at an earlier age than was the case 15 
to 20 years ago. Nevertheless, it remains unknown whether Durkin’s prediction of a 
larger population of early readers is accurate. 


Method 
Purpose of the Study 


Answers were sought to the following questions: Prior to any formal school in- 
struction in reading, what percentage of children first entering kindergarten in the 
Saskatoon Public School System, where English is the language of instruction, can 
be identified as early reader (ERS); what are the differences, if any, in the family 
backgrounds of early readers and nonearly readers (NERS); and can we differenti- 
ate early readers from nonearly readers in their book orientation skills? 


Instruments 


In September of 1984, three screening devices were used to identify ERS and 
NERS. The first device, a Teacher Questionnaire, requested kindergarten teachers 
to identify first-time kindergarten entrants who were reading 20 or more words. A 
second screening device, the Durkin Word List was developed by Durkin (1966) for 
her study of ERS and consists of 37 words and six sentences commonly found in 
basal readers. Children who could correctly pronounce 18 or more words within ap- 
proximately 20 seconds were considered to be potential early readers while those © 
unable to meet these criteria were considered to be NERS. The final screening de- 
vice was the Durrell Analysis of Reading Difficulty, Oral and Silent Paragraphs 
(1980). Potential early readers were asked to read both the oral and silent para- 
graphs and answer the comprehension questions. Those children who answered 
three or more questions correctly for each paragraph were considered to be ERS. 


Two measurement instruments, the Book Orientation Scale and Parent 
Questionnaire were used to investigate possible differences in book orientation 
skills and family backgrounds between ERS and NERS. The Book Orientation 
Scale was used in conjunction with the children’s book Your Turn Doctor (Perez & 
Robinson, 1982). This scale, based on the Sand Scale (Clay, 1972b), and the re- 
search of Goodman and Altwerger (1981) was developed by the investigators. It was 
designed to measure a child’s knowledge about the conventions, terminology, and 
methodology used in describing and reading a book. The Parent Questionnaire was 
also compiled by the researchers and consisted of 45 questions concerning the 
demographic characteristics of the family, reading habits of the family, home learn- 
ing experiences including television viewing, travel, preschool attendance, informal 
instruction in printing, reading and mathematics, and recreational and educational 
interests of the subject. Each item on the Parent Questionnaire gave a number of 


224 


Early Reading and Non Early Reading Kindergarten Entrants 


possible answers and provided space for additional choices or comments. The 
Parent Questionnaire was piloted in August 1984, prior to commencement of this 
study. 


Procedure 


In September 1984, all the 44 kindergarten teachers in English instruction 
classrooms within the Saskatoon Public School Division (N=1405 students) were 
asked to complete the Teacher Questionnaire. During October 1984, the identified 
children (13 males and 27 females) were administered the Durkin Word List and 
the Oral and Silent Paragraphs of the Durrell Analysis of Reading Difficulty 
(1980) within their school by the researcher. Twenty-three ERS were identified. A 
random selection of 20 ERS was made and 20 nonearly readers (NERS) were 
matched by classroom. The ERS (eight males and 12 females) and the NERS (12 
males and eight females) were administered the Book Orientation Scale by the re- 
searcher within the subject’s school. Testing time did not exceed 35 minutes. The 
caregivers of the randomly selected ERS and NERS were then mailed the Parent 
Questionnaire. Parent Questionnaires were completed and returned by the 
caregivers of 16 ERS and 15 NERS. Interviews were conducted with four ERS’ 
caregivers and four NERS’ caregivers who were selected randomly. All data were 
gathered by December 1984. Data were analyzed using either a t-test, chi-square, or 
statement of frequency of responses. Extensive profiles were developed for the ERS 
and NERS interviewed. 


Results 
Percentage of Population Identified as ERS 
Early readers represented 1.64% of the population of students who were en- 
rolled in kindergarten for the first time and who were attending English instruction 
classrooms. This supports findings by Clark (1976) and King and Friesen (1972) 
and is somewhat lower than that found by Durkin (1966), Harty (1975), and 
Hochstetler (1975). 


Demographic Differences Between ERS and NERS 

Statistically significant differences (p<0.05) were found between ERS and 
NERS on the following demographic features: (a) significantly more ERS had an- 
other adult living in the home in addition to their male and female caregivers than 
did NERS (t=2.251); (b) the achieved educational level of the ERS’ male 
caregivers were significantly higher than those of the NERS’ male caregivers 
(t =2.498); (c) the ERS’ male caregivers were employed at significantly higher oceu- 
pational levels, as classified by Statistics Canada (1982), than were NERS 
caregivers (t=2.218). ERS’ caregivers were more likely to be employed as profes- 
sionals or semiprofessionals (62.4% ) than were NERS’ male caregivers (33% ). 


Differences in Home Learning Experiences Between ERS and NERS 7 
Early readers were found to differ (o<0.05) from NERS in their home learning 
experiences in the following ways: (a) ERS’ male and female paTeet ote read maga- 
zines with significantly greater frequency than did the NERS caregivers (t =2.801); 
(b) the frequency with which the ERS’ female caregiver read the newspaper was 
significantly greater than that of the NERS’ female caregiver (t=2.802); (c) ERS 
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began to copy or write letters at a significantly earlier age than did NERS X in 
months: ERS=32.6; NERS=42.6; t=2.454); (d) ERS watched significantly more 
television per day than did NERS (X in minutes per day: ERS=116; NERS=70; 
t=2.086); (e) significant differences were found in the reported benefits of televi- 
sion viewing with more ERS’ caregivers stating that television was a medium by 
which their children learned about the world (t=2.186) and increased their 
vocabularies (t=2.374) than did NERS’ caregivers; (f) significant differences were 
found between what the ERS’ and NERS’ caregivers felt had influenced their 
children’s interest in reading. More ERS’ caregivers felt watching television 
(t=2.515), their children’s curiosity about the words (t=3.546), and telling stories 
(t=2.158) had influenced their child’s interest in reading than did NERS’ 
caregivers; (g) the caregivers of ERS gave their children significantly more help in 
three areas relating to learning to read than did the NERS’ caregivers. They 
discussed letter sounds (t=2.086), helped with spellings (t =2.4998), and identified 
words (t=2.498) more frequently than did the NERS’ caregivers; (h) when 
caregivers were asked to give three descriptors of their children, significantly more 
ERS’ caregivers termed their children as “shy, introverted, or self-centered” (50% 
to 18%) and as “curious” (50% to 13%) than did NERS’ caregivers. In contrast, 
significantly more NERS’ caregivers termed their children as “loving, caring, kind, 
or perceptive” (67% to 25%) than did ERS’ caregivers. 


Book Orientation Skills 


ERS were found to be significantly (p<0.05) superior to NERS in their book 
orientation skills. Specifically, ERS were significantly better at locating the begin- 
ning of the story within the book (t=2.690), showing left-right reading patterning 
(t=3.794) identifying the beginning (t=6.294), and ending of the story (t=6.294), 
identifying the authors of the story (t=2.847), identifying a word (t=4.127), identi- 
fying a capital letter (t=2.333), recalling detail ({=3.509), comprehension of the 
story (t=3.509), and knowledge of the term “last” (t=2.626). The data are 
consistent with previous research which found NER kindergarten entrants to be 
unfamiliar with the physical act of reading and the terminology commonly used in 
describing books (Barney, 1976; Clay, 1972a; Downing, 1970; Reid, 1966). 


Interviews with Caregivers 


Interviews with caregivers of four ERS and four NERS supported the data and 
also indicated the following differences between ERS and NERS: (a) ERS’ 
caregivers, especially their mothers, appeared to have labeled objects within the en- 
vironment, written notes, numbers, and words more often for their children than 
did NERS’ caregivers; (b) although the caregivers of ERS did not intentionally set 
out to teach their children to read, they held and expressed a positive attitude 
towards their child’s potential success with learning to read more frequently than 
did the NERS’ caregivers. This positive attitude appears to have encouraged the 
development of a stimulating and learning oriented environment within the ERS’ 
home. As well, the caregivers’ positive attitude towards their child’s future success 
with reading and learning in general may have caused the caregivers of ERS to set 
higher expectations for their children than were set by the caregivers of NERS; 
(c) all ERS appeared to have a great facility for learning and remembering. Learn- 
ing was viewed as a challenge, but also as a “fun-filled” experience; (d) ERS ap- 
peared to be self-motivated and showed great initiative in their desire to learn not 
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only about reading, but also about mathematics, writing, and the world around 
them; ERS’ caregivers considered them to be curious children; (e) ERS appeared to 
use television mainly as an information source, and secondarily as a source of rein- 
forcement of skills already learned and as a source of entertainment. This is in con- 
trast to NERS who appeared to use television mainly as a medium of 
entertainment and secondarily as a means of learning about the world; (f) the ERS 
appeared to be less interested in active, physical activities than were NERS. Any 
interest shown by ERS in physical activities was considered to have been 
stimulated through peers. 


Discussion 


Durkin (1978) asserted that more and more preschoolers were learning to read 
at an earlier age than was the case 15 to 20 years before. However, in this study 
readers represented 1.64% of the total population of kindergarten classrooms 
where English was the language of instruction. This supports previous findings 
(Briggs & Elkind, 1977; Clark, 1976; King & Friesen, 1972) and is somewhat lower 
than found by Harty (1975) and Hochstetler (1975). One might hypothesize that in 
Saskatoon the percentage of ERS in non-English instruction classrooms may be 
slightly higher. This hypothesis is based on the fact that most French and Ukrain- 
ian Immersion classes in this city are located in upper socioeconomic areas where 
caregivers are educated at a higher academic level and are presently employed in 
higher status occupations. In addition, since teacher judgment was used as the ini- 
tial screening device, some early readers may have been missed. Nevertheless, in 
this particular sample, there does not appear to be a large increase in the number 
of ERS as was predicated by Durkin (1978). 

Demographic differences found in previous research are, for the most part, 
supported by this study, while previous literature regarding the home learning ex- 
periences of ERS is both supported and contradicted (Table 1). One must conclude 
that additional factors to be discussed below, but not investigated in this and pre- 
vious research, play a role in determining whether or not a child will become an 
ER. 

Briggs and Elkind (1977) claimed that interest on the part of parents rather 
than the child may have been the main motivational determinant. The present re- 
search indicted that the caregivers of ERS held and expressed a more positive atti- 
tude towards their child’s future success in reading than did the NERS 
counterparts. Although this appears to be a factor, one must still question whether 
parental attitude and expectations, though a necessary factor, represent the major 
determinant in the development of reading skills in these children, or whether posi- 
tive parental attitudes, higher expectations, and a stimulating environment act as 
supportive secondary factors to “the nature of the child as an organism” (Thomas, 
Chess, Birch, Hertzig, & Korn, 1963). 

The question remains, what differentiated the “nature” of ERS from the “na- 
ture” of NERS? On the basis of the limited data available in this and previous 
studies, one can only speculate about possible answers. 

Schema theorists suggest that comprehension and learning are a function of the 
relationship between information and the currently available skills and knowledge 
of the learner. Bransford (1979) expands on the nature of this relationship stating 
the “most important factor underlying our ability to learn is the availability of cri- 
teria for evaluating the adequacy of our present level of understanding and 
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mastery” (pp. 194-195). Initially, ERS appeared to practice and rehearse concept 
definitions. This was exemplified in their frequent requests that letters, words, and 
numbers be written and rewritten and in their exhibition of “reading-like” behav- 
iors as described by Doake (1982) and Huey (1908). According to their caregivers, 
all of the ERS and most of the NERS exhibited “reading-like” behaviors or “mock 
reading.” However, as shown in the significant differences between ERS and NERS 
on many of the items on the Book Orientation Scale, ERS have a superior knowl- 
edge of the act of reading and the terminology associated with the reading of print. 
Bransford (1979) suggested, “the rehearsal of concept definitions, facilitates re- 
membering but does not increase understanding” (p. 233). 


The literature about metacognition emphasizes the importance of analyzing the 
types of experiences that help people learn to utilize appropriate knowledge when 
needed and those methods which fail to develop such skills. The home environ- 
ments of ERS shed some additional light in this area. First the ERS’ environments 
appear to encourage active and spontaneous practice in dealing with new situa- 
tions. For example the mother of Andrew, an ER, stated 

I almost always take five or ten minutes to talk about or explain what was on the [televi- 
sion] show when it’s finished. If there’s something Andrew or one of the other chidren 
want to know about we look it up in the encyclopedia or get a book from the library the 
next time we’re there. 

A second example was provided by the father of another ER, Alex. While 
browsing through a magazine, Alex, aged four, read an equation in a “skill testing 
question” found in an advertisement. Subsequently he asked his father how to 
complete the computations. Alex’s father responded by not only showing Alex the 
mathematical procedures to be completed, but he also provided a means, in this 
case a calculator, by which Alex could successfully complete the mathematical com- 
putations and solve the equation. At the time of this study, Alex, aged 61 months, 
was writing and developing his own equations using both an x and y variable, and 
he was successfully solving these equations using the calculator. Thus it appears 
that if an ER did not understand a concept presented on television or in print, he 
or she was encouraged, shown, modeled, and provided with a practical means for 
solving gaps in knowledge. 


Secondly, ERS live in an environment where feedback is very much the norm. 
At a metacognitive level, ERS unlike NERS appear to be able to monitor accu- 
rately their current levels of understanding and mastery. They also appear to be 
able to take action through questioning their parents and seeking information from 
television, or from print, to raise their current level of understanding. Their envi- 
ronment appears to encourage action. They are not fearful of being thought foolish 
for seeking answers to gaps in their knowledge, whether it is in the recognition of a 
letter, a sound, or a word. Thus the environment of ERS allows for risk-taking with 
the generating and testing of hypotheses actively encouraged, if not demanded. 
Levine (1972) suggests individuals who learn best do not appear to passively absorb 
information; rather they attempt to analyze and process information to form con- 
cepts. This is certainly true of ERS who, as was indicated by the interviews, took 
an active role in their learning, often selecting topics for which they requested 
additional information. All the ERS asked that books be read and reread to them. 
Many requested that letters be written and identified. The words of one ER’s 
mother illustrate the strong desire the ERS had to learn. 
Previously, especially between one and four years, he was asking so many questions and 
he asked me to read all his books . . . he’d ask me to write all the numbers to 100 at least 
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four times . . . he’d ask me to read every day first thing in the morning. . . . Initially, he 
wouldn’t even allow me to take a bath; he wanted me to read all his books at once. ... 
Even when we went to a friend’s house he’d ask me to bring some paper and something to 
write with so I could write them [the numbers from 0 to 100 and letters]. 

One might also hypothesize that ERS are more able than NERS to anticipate 
situations in which their knowledge must be utilized. The profiles of ERS indicated 
that an ER might use television listings to determine what programs he or she 
could view, or read the cover of a record album to select the music to which he or 
she might listen. The television, signs, magazines, and record albums read by ERS 
became more than just a viewing or listening activity. Rather, ERS used these ma- 
terials as sources of information and tools for learning more about letters and 
words. But more importantly, they anticipated potential uses of print, made self- 
evaluations of their strengths and weaknesses, and determined either that they 
were adequately prepared or that they required additional information and prac- 
tice. ERS then set realistic goals for themselves and proceeded to manipulate their 
environments so that they could reach these goals. For example, if an ER was unfa- 
miliar with several letters, he or she might demand that caregivers identify and 
write these letters. Their caregivers would then indirectly place them in a number 
of “test situations” by playing rhyming games, alphabet bingo, or cloze games. 
Similar strategies were employed with concepts relating to mathematics, words, 
and the world around them. Furthermore, the caregivers of ERS often raised ques- 
tions when reading and speaking with their children that stimulated and further 
expanded their children’s knowledge base. The test situations were always low risk, 
low stress, as well as game activities. Hence learning remained a fun-filled, enjoy- 
able activity. 


Implications and Recommendations 

Within a structured learning environment, teachers can apply some of the 
strategies employed by the caregivers of ERS. As Durkin (1978) and Clay (1980) 
have suggested, teachers should read to and listen to children. Furthermore, adults 
should be reminded to speak with, as opposed to at, children. Teachers can and 
should model reading, not only as a leisure activity but also as a means by which 
questions about the world can be answered. Strategies and steps for seeking infor- 
mation need to be discussed as they are modeled. Rather than suggesting 
additional information may be found in a reference book, teachers should model 
and discuss not only where this information may be found, but also how new infor- 
mation relates to what the child already knows. 

Just as the ERS’ homes were child-centered, so too should be the classroom 
learning environment. There should be opportunities for self-initiated and self-di- 
rected learning from highly interesting materials. 

The homes of ERS allowed for and encouraged practice activities that were 
highly motivational, game-like, manipulative, and social in nature. Inherent in the 
practice activities of the ERS was the assumption that if learning is occurring, 
errors and mistakes are acceptable, if mot necessary. Risk taking and 
nonthreatening “testing” of newly acquired learning was, at the very least, encour- 
aged and at the most, demanded by the caregivers of ERS. Expectations on the 
part of the caregivers were always positive in nature; ERS were viewed as children 
who could and would learn. As McCracken and McCracken (1985) suggest, the use 
of practice activities where errors are acceptable, can and should be implemented 


into classrooms. 
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Suggestions for Further Research 


Major gaps in our knowledge base exist with respect to the everyday social in- 
teraction of ERS and their caregivers. Since the families of several ERS contained 
multiple ERS, the younger siblings of identified ERS provide a population which 
may shed additional light in this area. 


Secondly, ERS have been termed “supercurious,” “self-motivated,” and “self-di- 
rected” learners. The caregivers in this study restate that they, like parents in pre- 
vious studies of ERS, began direct and indirect instruction at the request of their 
children. An explanation ought to be sought about what initiated this intrinsic 
motivation or curiosity in ERS, and why it is either not found or not initiated in all 
children. 


Conclusion 


In summary, the environment of ERS appears to have been conducive to learn- 
ing. Caregivers of ERS described their children as truly active learners, having a 
sense of what they needed to do in order to facilitate their own understanding of 
reading and learning in general. These caregivers provided an environment that 
was conducive to and highly supportive of asking questions for clarification; an en- 
vironment with caregivers who modeled and reinforced the development of 
strategies for completing gaps in knowledge; and an environment in which fearless 
practice and self-monitoring of comprehension was facilitated. The environment 
created by these caregivers encourages concept development, skill development, 
and the development of criteria for guiding learning. According to the caregivers of 
ERS, they taught their children not how to read, but how to learn. The question of 
who or what initially triggered the ER’s first interactions with his or her environ- 
ment, and whether this interaction was due to the nature of the child or to the na- 
ture of the environment, remains unanswered. 
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This study investigated the relationships between four perceived role related 
stressors (role conflict, role ambiguity, role overload, and powerlessness), two affec- 
tive reactions to work (emotional exhaustion and job satisfaction), and thrust leader- 
ship behavior among high school principals. Principals from 74 New Jersey high 
schools completed surveys containing measures of role stressors and responses to 
work. A random sample of teachers in each school reported on the principal’s thrust 
behavior. All four stressors: were significantly correlated with the two affective 
responses to work. However, the stressors explained more variance in emotional ex- 
haustion (R2=.65) than in job satisfaction (R2=.39). Neither the stressors nor the af- 
fective responses were related to principal thrust except for the age range of 40-46. 


Over the previous decades, educators have been beset with a multitude of prob- 
lems. However, no problem may prove more debilitating than one confronting them 
now: job dissatisfaction, emotional and physical exhaustion, and the many other 
psychological, physical, and behavioral manifestations related to job stress or 
“burnout.” Yet while stress and burnout have become established topics in educa- 
tional research in recent years, teachers have received more attention than admin- 
istrators. In a recent and extensive review of Canadian research on teacher stress, 
for example, Hiebert (1985) included a short “Note on School Administrators,” 
which referenced only three empirical studies. The purpose of this study was to 
add to the small stock of conceptually based empirical research on stress among 
school administrators; the focal role was that of the high school principal. 
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Not only are there few studies of stress among principals, but the results of ex- 
isting studies conflict. Especially problematic is the level of stress reported among 
principals. On the one hand, there are reports of considerable stress. Lusthnus 
(1975), for example, found that principals in Quebec experienced more pressure in 
their jobs than teachers or other professional groups. Gmelch, Koch, Swent, and 
Tung (1982) conducted a survey of all Oregon school administrators from vice prin- 
cipal through superintendent. “More than 60% reported that at least 70% of their 
total life stress resulted from their jobs” (p. 11). Swent and Gmelch (1977) found 
administrators reported 75% of their total life stress came from their jobs. 
Syrotuik and D’Arcy (1983), in a study of administrators from Saskatoon, reported 
that role conflict was the most important source of stress. On the other hand, 
MacPherson’s (1985) study of principals in Nova Scotia showed low levels of role 
conflict, role ambiguity, and emotional exhaustion and mild to moderate levels of 
overall job stress. In an Alberta study, Jankovic (1983) reported only mild to mod- 
erate levels of stress among 88% of principals sampled. These conflicting results 
among the small number of existing studies are probably related to the differing 
conceptual frameworks and operational definitions used in the various inquiries. 


Given the mixed results to date, several reasons remain for pursuing studies of 
principal job-related stress and its effects. First, there is a concern for the individ- 
ual occupying the role. It is important, both theoretically and practically, to under- 
stand the types and amounts of stress inherent in the school leadership role and 
what the effects of these are likely to be for the individual. The rationale for this 
concern is self-evident, and all the empirical studies of administrator stress refer- 
enced in this study address aspects of this issue. Second, because of the principal’s 
hierarchical position, stress is likely to have an organizational dimension as well as 
a strictly individual one. Caldwell and Forney (1982), for example, note that “the 
principal has been considered as the most influential factor in determining the cli- 
mate, direction, and leadership of a school” (p. 1). Erez and Goldstein (1981), in a 
study of Israeli principals, argued that experienced stress is a major factor affecting 
principal leadership efficiency. Thus, because of the principal’s central position in 
the school’s social system, debilitating effects of stress on the individual may be 
translated into larger negative effects for the whole school. 


Problem 


This study investigated the relationships between perceived role related job 
stressors, internal affective responses to work, and a leadership behavior among 
high school principals. Three organizational factors (role conflict, role ambiguity, 
and role overload) and a dimension of alienation (powerlessness) were examined to 
determine their relationships with two affective responses to work: job satisfaction 
and emotional exhaustion, a component of Maslach and Jackson’s (1978) burnout 
syndrome. Moreover, these four job stressors and two affective conditions were 
analyzed to determine their relationships with a leadership behavior presumed to 
have organizational consequences, the thrust behavior of the principal. Thrust is 
the principal’s attempt to motivate teachers through setting a personal example. 
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Theoretical Framework 


Of the four job related stressors examined in this study, three have their con- 
ceptual roots in role theory. Role conflict refers to a situation in which an individ- 
ual perceives incompatible work demands. Kahn, Wolfe, Quinn, and Snoek (1964) 
reported that role conflict was related to various consequences of job stress includ- 
ing job dissatisfaction, high job-related tension, greater sense of futility, and lower 
self-esteem. 


Role ambiguity exists when a person perceives inadequate information about 
expectations for his or her work. It has been linked to job dissatisfaction, poor pro- 
duction, and anxiety (Kahn, Wolfe, Quinn, & Snoek, 1964), increased blood pres- 
sure and pulse rate (French & Caplan, 1973), depression and resentment (Caplan & 
Jones, 1975), lower self-esteem and life dissatisfaction (Margolis, Kroes, & Quinn, 
1974). 


Role overload refers to a perception of more work than a person can complete 
in a given time. Kraut (1965) and Mueller (1965) reported a variety of negative re- 
actions in individuals exposed to overload: tension, low job satisfaction, poor 
interpersonal skills, and low self-esteem. Moreover, French and Caplan (1973) 
found it to be related to absenteeism, low motivation to work, and an absence of 
suggestions to employers. 


The fourth precursor of job stress examined in this study is one of the five di- 
mensions of alienation conceptualized by Seeman (1959). Powerlessness is the per- 
ception that an individual cannot control the outcomes he or she seeks. It has been 
linked to job dissatisfaction (Shepard, 1972), poor job performance and tardiness 
(Cummings & Manring, 1977), lower self-esteem and increased anxiety (Gilbert & 
Mangelsdorff, 1979). 


Emotional exhaustion and job satisfaction were the two affective responses to 
work examined in this study. Emotional exhaustion is the feeling that one’s emo- 
tional resources are depleted; that one is no longer able to give to others on the 
psychological level. Associated with emotional exhaustion is a sense of physical ex- 
haustion and a lack of any positive feeling, sympathy, or respect a professional 
would normally show for clients. It is a subset of an overall burnout index devel- 
oped by Maslach and Jackson (1978). Researchers have found positive relationships 
between emotional exhaustion and the following: low morale, high absenteeism, 
and turnover (Maslach & Jackson), cynicism, negativism, inflexibility, and a sense 
of helplessness and hopelessness about one’s clients (Freudenberger, 1977). 


Job satisfaction is the affective evaluation an individual experiences toward a 
job. It has been negatively related to the following outcomes: absenteeism (Porter 
& Steers, 1973), turnover (Waters & Roach, 1971), early retirement (Vroom, 1964), 
tardiness and grievances in organizations (Mirvis & Lawler, 1977), and poor mental 
health (Kornhauser, 1965). Friesen, Holdaway, and Rice (1984) noted that most 
studies of job satisfaction have focused on production workers rather than adminis- 
trators. They argued the need for more studies of administrators and specifically 
suggested investigating the relationship between job stress and dissatisfaction. 


The general conceptual link between role based stressors in the work place and 
negative affective responses to work was established and first empirically tested by 
Kahn et al. (1964). More recently these relationships have been confirmed in 
studies of several groups of educators. Role conflict and ambiguity have been corre- 
lated with perceived emotional exhaustion among several general teacher samples 
(Schwab & Iwanicki, 1982; Beck & Gargiulo, 1983), special education teachers 
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(Beck & Garguilo, 1983), and high school department chairpersons (Kottkamp & 
Mansfield, 1985). Role conflict, but not role ambiguity, has been related to emo- 
tional exhaustion among principals (MacPherson, 1985). Perceived powerlessness 
was also correlated with emotional exhaustion among secondary department chair- 
persons (Kottkamp & Mansfield, 1985). Role conflict and ambiguity were corre- 
lated with job dissatisfaction and job-related tension among multidisciplinary team 
administrators (Kottkamp & Helfman, 1985). Role overload, a separate concept but 
related to role conflict and ambiguity, has yet to be empirically tested as a stressor 
among educators. However, it is reasonable to assume that it is related to affective 
responses to work in ways which parallel those of the other three stressors for 
which evidence already exists. 


Taken together, this conceptual and empirical background leads to two 
hypotheses: 
H, There are negative relationships between the perceived role conflict, 
role ambiguity, role overload, and powerlessness of the high school 
principal and his or her job satisfaction. 


H. There are positive relationships between the perceived role conflict, 
role ambiguity, role overload, and powerlessness of the high school 
principal and his or her emotional exhaustion. 


The first two hypotheses were designed to test relationships between role 
stressors and two affective responses: job satisfaction, and emotional exhaustion, at 
the individual level. However, there was also interest in pursuing an independently 
measured behavioral outcome of the stressors and the two affective responses, a be- 
havioral outcome which would be likely to effect the organization as well as the in- 
dividual. 

The literature indicated that the job stressors and affective responses in this 
study have been related to various behavioral outcomes. Role conflict and ambigu- 
ity have been associated with decreased productivity (Kahn et al., 1964). Role 
overload has been correlated with poor interpersonal relations (Kraut, 1965; 
Mueller, 1965), and high absenteeism (French & Caplan, 1973). Powerlessness has 
been related to poor performance and increased tardiness (Cummings & Manring, 
1977). In a similar vein, negative affective responses to work have been related to 
the following negative behaviors. Job dissatisfaction has been correlated with in- 
creased absenteeism and tardiness (Mirvis & Lawler, 1977; Waters & Roach, 1971), 
while emotional exhaustion has also been associated with higher absenteeism, job 
turnover (Maslach & Jackson, 1978), and cynicism and negativism toward clients 
(Freudenberger, 1977). 

A search was carried out to identify a principal behavior with organizational 
consequences, a behavior which could also be related to the stressors and affective 
responses of job satisfaction and emotional exhaustion. The leadership behavior, 
thrust, was chosen. Halpin and Croft (1962) defined thrust as a dynamic behavior 
of the principal characterized by efforts to “move the organization.” Though 
“starkly task-oriented,” this behavior is not marked by close supervision. Rather, 
the principal asks no more of teachers than he or she is willing to give and leads 
through active personal example and modeling of desired behavior. 

High thrust behavior is an essential component of open school climate as 
conceptualized by Halpin and Croft (1962). Climate openness has, in turn, been as- 
sociated with a number of other variables typically considered to be desirable: 
higher teacher satisfaction (Schwandt, 1978) and loyalty to the principal (Kanner, 
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1974); greater teacher confidence in their own and the school’s effectiveness 
(Andrews, 1965); higher humanistic Pupil Control Ideology scores (Appleberry & 
Hoy, 1969); greater innovativeness (Thomas, 1976); and lower student alienation, 
higher student morale, and lower dropout rates (Hartley & Hoy, 1972; Mullins, 
1976). Because scholars have argued that the principal is the key figure in setting 
the school climate (Drummond, 1970; Fox, 1973; Hoy & Henderson, 1983) and be- 
cause the correlates of open climate are typically considered positive, the principal 
with high thrust behavior is likely to have a positive effect on his or her school 
while the principal with low thrust behavior is likely to have a negative effect. 


Most of the individual outcomes of role stressors and their negative behavioral 
responses such as decreased productivity, absenteeism, tardiness, job turnover, and 
negativism towards clients may be categorized as withdrawal behaviors. These be- 
haviors indicate either a physical or a psychological withdrawal from central ele- 
ments of effective work performance. Quite the opposite of withdrawal from work 
is thrust, a leadership behavior indicated by such activities as high energy expendi- 
ture and modeling, a leadership behavior which has beneficial consequences at the 
organizational level. Thus, if role stressors and their negative affective responses 
have been associated with various withdrawal behaviors in other work settings, it is 
logical to predict that stressors and their negative responses will be related to re- 
duced energies put into the work of the principalship. This reasoning led to the 
hypothesis: 


H, Perceived role conflict, role ambiguity, role overload, powerlessness, 
job dissatisfaction, and emotional exhausion are negatively related to 
the principal’s thrust behavior. 


Method 
Sample 


Data for the study were collected by means of questionnaires distributed to the 
principals and teachers in 74 public high schools in New Jersey. Although partici- 
pation was voluntary, an effort was made to obtain a reasonably representative 
sample of high schools in the state. Sample schools had student populations rang- 
ing from 250 to 2000, were drawn from 17 of 22 New Jersey counties, and were 
drawn from urban, suburban, and rural areas. The sample schools also covered the 
whole range of 10 District Factor Groups (DFGs) established by the New Jersey 
State Department of Education. DFGs are computed as a weighted composite of 
district averages of seven socioeconomic and demographic factors: educational and 
occupational background, per capita income, poverty and unemployment rate, pop- 
ulation density and mobility, and degree of urbanization. The sample schools were 
skewed slightly toward higher DFG ratings or higher socioeconomic status when 
compared to the state average. 


The principals completed scales to measure the four job stressors and two affec- 
tive responses to work. The data collected for this study were part of a set of re- 
lated studies of high schools which in total required four independent teacher 
questionnaires. Therefore, a randomly selected one-fourth of the teaching faculty 
of each school was administered a scale to measure the thrust behavior of the prin- 
cipal. Because teacher surveys were completed in faculty meetings, usable return 
rates exceeded 90%. 


The unit of analysis was the individual high school principal in the context of 
his or her particular school and district. Teacher responses on the thrust scale were 
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aggregated to form a school level perceptual score. The data were analyzed through 
correlation and regression analyses. Secondary analyses included t-tests and Fisher 
Z transformations. 


Instrumentation 


Role Conflict, Ambiguity, and Overload. Role conflict, ambiguity, and overload 
were measured by a 15-item instrument (5 items for each role stressor) refined by 
Abdel-Halim (1978) from the previous work of Beehr (1976) and Rizzo, House, and 
Lirtzman (1970). In Abdel-Halim’s instrument, each respondent is asked to indi- 
cate on a 7-point scale ranging from “very false” to “very true,” the extent to which 
the items described his or her work situation. For these scales and the measures of 
powerlessness, job satisfaction, and emotional exhaustion, a consistent set of 
6-point response scales with poles of “strongly agree” and “strongly disagree” were 
used. This modification was made so that all six measures could be intermixed in 
the survey to reduce response bias. 


Construct validity of the instrument was established by Beehr (1976) and Rizzo, 
House, and Lirtzman (1970). Reliability coefficients of .76 (conflict), .69 (ambigu- 
ity), and .74 (overload) were reported by Abdel-Halim (1978) using the 
Spearman-Brown formula. In this study the alpha coefficients were .70 (conflict), 
.62 (ambiguity), and .81 (overload). 


Powerlessness. Powerlessness was measured by a 12-item subscale of an instru- 
ment developed by Zeilinski and Hoy (1983). The subscale measures perceived 
powerlessness in the school as a whole or what the researchers called 
Organizational Effect. Construct validity for the instrument was assessed by 
Zeilinski and Hoy. High reliability was reported for the 12 items, an alpha coeffi- 
cient of .92. The data in this study generated an alpha coefficient of .88. 

Emotional Exhaustion. This concept was measured through one subscale of the 
Maslach Burnout Inventory (MBI) developed by Maslach and Jackson (1978). The 
MBI is a measure of perceived burnout along three dimensions: emotional exhaus- 
tion, depersonalization, and personal accomplishment. The original instrument 
consists of 22 items which are rated twice, once for frequency and once for inten- 
sity. For this study, only the 9-item emotional exhaustion subscale was used. 
Iwanicki and Schwab (1981) had suggested collapsing the frequency and intensity 
responses into a single response category because of the high correlations (.89 for 
emotional exhaustion) between the two when used in a study of teachers. Conse- 
quently, a single 6-point “strongly agree” to “strongly disagree” scale was used in 
this study. 

Maslach and Jackson (1979) reported alpha reliability coefficients for the emo- 
tional exhaustion subscale of .89 (frequency) and .87 (intensity). In this study the 
alpha coefficient for the modified emotional exhaustion scale was .88. Several in- 
vestigators (Iwanicki & Schwab, 1981; Maslach & Jackson, 1979) have demon- 
strated the validity of the MBI through concurrent, discriminant, and construct 
methods. 

Job Satisfaction. The concept of job satisfaction was measured by the 6-item 
Index of Satisfaction developed by Miskel, Fevurly, and Stewart (1979). They pro- 
vided evidence of content validity for the instrument and reported an alpha coeffi- 
cient of .78. The alpha in this study was .71. 

Thrust Behavior. This principal behavior was measured using a scale similar to 
the thrust subscale of the Organizational Climate Description Questionnaire 
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(OCDQ) developed by Halpin and Croft (1962). As part of the work to revise the 
OCDQ for secondary school use (Kottkamp, Mulhern, & Hoy, 1985) the original 
thrust subscale was modified by adding two items to the original 9-item scale. The 
new 11 item thrust subscale asks respondents to reply to statements about the 
principal’s behavior on a 4-point scale ranging from “rarely occurs” to “very fre- 
quently occurs.” The alpha coefficient for the revised thrust subscale reached .93 in 
this study. 


Results 
Descriptive Statistics 


The scale and item means, standard deviations, and zero order correlations for 
the variables in the study are presented in Table 1. All principal self-report vari- 
ables were measured on 6-point scales. The conceptual mean for a 6-point scale is 
3.5. As may be seen in Table 1, mean item responses of principals to job satisfac- 
tion (4.56) were comparatively high, while both role conflict (3.46) and role 
overload (3.49) hovered right at the conceptual mean. Emotional exhaustion (2.26) 
was comparatively low at more than one point below the conceptual mean, and 
powerlessness (1.88) showed the lowest mean item response of all the variables. 
Principal thrust was reported by teachers on a 4-point scale with a conceptual 
mean of 2.5. The mean item response (1.45) indicated that teachers perceived rela- 
tively low thrust from their principals. 


TABLE 1 


Means, Standard Deviations, and Intercorrelations 


Variable 1 2 3} 4 5 6 Item Scale Scale 
Mean Mean SD 


1. Role Conflict 3.46 17.30 5.07 
2. Role Ambiguity aor 2.24 11732 3.31 
3. Role Overload .46* .40* 3.49 17.47 0.00 
4. Powerlessness .44* .50* pods 1.88 22.55 8.26 
5. Job Satisfaction Se PNR eA se late SS 8h 4.56 27.34 5.13 
6. Emotional rode .40* .68* Woe eee Ol 2.26 20.38 8.22 
Exhaustion 

7. Thrust Behavior = ln! =) — Ys} 01 120i 1.45 15.98 2.28 
*p<.01 


Several patterns are evident among the correlations. First, the three role per- 
ception variables and perceived powerlessness are all related at moderate levels, 
with correlations ranging from .36 (conflict with ambiguity) to .57 (overload with 
powerlessness). Thus job stressors in this study share proportions of variance rang- 
ing from one-eighth to one-third. These proportions of shared variance indicate 
both relatedness and relative independence, as has been found in previous studies 
(Abdel-Halim, 1978; Rizzo, House, & Lirtzman, 1970). Clearly, the job stressors are 
not tautological, but their relatedness is likely to result in some collinearity when 
the affective and behavioral variables are regressed on them. 
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The second pattern is that all job stressors are negatively correlated with job 
satisfaction and positively correlated with emotional exhaustion at statistically sig- 
nificant levels. These relations range from .40 (ambiguity with emotional exhaus- 
tion) to .72 (powerlessness with emotional exhaustion). These relationships suggest 
support for Hypotheses One and Two. However, a third pattern is that none of the 
job stressors, job satisfaction, or emotional exhaustion have significant relation- 
ships with thrust behavior. These correlations suggest the rejection of Hypothesis 
Three. 


Hypothesis Testing 


To gain a more thorough understanding of the individual and combined effects 
of the variables in the hypothesized relationships, simultaneous inclusion, multiple 
linear regression analyses were conducted. Table 2 reports the results of regressing 
satisfaction on the four job stressors, a test of Hypothesis One. The total equation 
reaches significance, indicating that taken together the four independent variables 
do predict job dissatisfaction. Further, the adjusted R? indicates that slightly over 
one-third of the variance in satisfaction is related to the variance in the four job 
stressors. Scrutiny of the betas and their significance level indicates, however, that 
only role conflict and powerlessness are individually significant predictors of job 
satisfaction when all four independent variables are simultaneously related to the 
dependent variable. Hypothesis One receives support, but the regression analysis 
clarifies the relative strength of the relationships between the job stressors and sat- 
isfaction when the stressors are considered jointly. 


TABLE 2 


Summary for the Regression of Perceived Principal Job Satisfaction on Four Perceived 
Job Stressors 


Independent b beta Standard t Adjusted 
Variables error beta Naas 
Role Conflict = a ey 10 = 206" 
Role Ambiguity Paul =14 16 cob ieet | 
Role Overload 06 —.06 plak =55 
Powerlessness =. 24 arate’ 07 ae 
Total Equation F=12.42** 385 

#5 <=.05 

sPahar Ny 


The results of regressing emotional exhaustion on the four job stressors are pre- 
sented in Table 3. Again, the total equation reaches significance, but the adjusted 
R? of .65 indicates that the four independent variables explain two-thirds of the 
variance in emotional exhaustion. The four job stressors, thus, explain twice as 
much variance in emotional exhaustion as in job satisfaction. The betas indicate 
that role conflict, overload, and powerlessness are individually significant predic- 
tors of exhaustion. Hypothesis Two also receives solid support. 
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TABLE 3 


Summary for the Regression of Perceived Principal Emotional Exhaustion on Four 
Perceived Job Stressors 


Independent b beta Standard é Adjusted 
Variables error beta R? 
Role Conflict .36 22 a3 Dare 
Role Ambiguity aug —.04 .20 —=,43 
Role Overload 49 ao a5} aoe 
Powerlessness ran 46 .09 ALOR ts 
Total Equation F=34.42** .647 
#4 <,01 


As could be predicted from the correlations in Table 1, the regression of thrust 
behavior on the four job stressors, job satisfaction, and emotional satisfaction did 
not prove significant. The results of this analysis are shown in Table 4. As may be 
seen, neither the equation nor any individual independent variable explains a sig- 
nificant amount of variance in thrust. Hypothesis Three is soundly rejected. 


TABLE 4 


Summary for the Regression of Principal Thrust Behavior on Four Perceived Job 
Stressors and Two Affective Responses 


Independent b beta Standard t Adjusted 
Variables error beta 1 

Role Conflict = (06 ais 07 — .89 

Role Ambiguity 209 aa | 10 ca afl 

Role Overload — (4 109 0p! =250 
Powerlessness O1 05 .05 26 

Job Satisfaction 05 212 07 = hs 

Emotional Exhaustion .00 02 .06 .02 

Total Equation F=,.522 .041 


Secondary Analyses 


Secondary analyses were conducted in order to investigate the failure of 
Hypothesis Three. These analyses were guided by two issues. First, age has been 
related to differences in perceived stress, but these findings are complex with some 
studies reporting low stress with advancing age (Indik, Seashore, & Slesinger, 1964; 
McGrath, 1970), while in boundary spanning positions (Tung & Koch, 1980) and 
among higher level executives, perceived stress increases with age (Pincherle, 1972). 
Second, the adult developmental literature indicates that significant stress related 
changes, often referred to as mid-life crisis, occur in the first half of the fourth dec- 
ade for males (Gould, 1972; Levinson, 1974; Lidz, 1980). 
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When age was correlated directly with the seven variables in Hypothesis Three, 
all correlations were close to zero. Apparently, there are no linear relationships be- 
tween age and the variables of interest in this study. The next analysis was based 
on the adult development literature. Principals were placed in two age related 
groups: 40-46 years of age (N=27) and 47 years of age and above (N=40). Those 
39 or under (N=6) were dropped from the analysis because the group was too 
small to allow for meaningful analysis. T-tests for mean differences between the 
two age groups were computed across all seven variables in Hypothesis Three. 
Neither the role stressors, affective responses, nor thrust behavior showed signifi- 
cant mean differences across these age groups. 


Finally, age was analyzed as a “moderator variable” (Zedeck, 1971; Arnold, 
1982), intervening in the relationships specified in Hypothesis Three. Correlations 
were computed between variables of interest within each age group and then com- 
pared across age groups for significant differences using Fisher’s Z transformation. 
The question answered was: Does the degree of the correlation (e.g., between thrust 
and role conflict) differ significantly between the two age groups? The correlations 
and the results of the Fisher Z transformations are shown in Table 5. The degree of 
the relationship across the two age groups was found significant in five of the six 
relationships, satisfaction-thrust being the only exception. Apparently, although 
age was not directly related to perceptions of the various stressors or outcomes 
themselves, it did, in the constructed age categories, moderate the degree of rela- 
tionships between the stressors, emotional exhaustion, and the behavioral outcome, 
thrust. Scatter plots for each age group were examined to see whether outliers were 
responsible for spurious correlations. This did not appear to be the case. 


TABLE 5 


Moderating Effects of Age Groupings on the Correlations Between Role Stressors, Af- 
fective Responses, and Thrust Behavior of Principals 


Principals Principals 
Variables Correlated 40-46 Years 47 Years and Over 
(n= 27) (n=40) 
Correlation Z Score 


Coefficients 


RG] THRUSTS —.54 —.03 2.20* 
RA—THRUST Eo 08 9.44* 
RO= TARwSH — 44 16 tales 
PO—THRUST =f) 18 ieee 
SA—THRUST 31 —14 L717 
EE—THRUST —.45 15 2.45% 


§RC=Role Conflict, RA=Role Ambiguity, RO=Role Overload, PO= Powerlessness, SA=Satisfaction, 
EE= Emotional Exhaustion 

*n<.05 
*#n<.01 
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Discussion 


The results of this study are in line with those of MacPherson (1985) and 
Jankovic (1983) in demonstrating that principals are not experiencing high levels of 
stress, in this case emotional exhaustion. In fact, although comparison is crude be- 
cause of the modified version of the Maslach Burnout Inventory used in this study, 
the high school principals showed a slightly lower mean level of emotional exhaus- 
tion than any of the four separate samples of teachers compared by Kottkamp and 
Mansfield (1985). Further, the sample principals responded with a moderately high 
mean level of job satisfaction, an affective work response possessing a negative cor- 
relation (—.61) with emotional exhaustion. These are encouraging findings to re- 
port to principals and to those who aspire to the position. 


The findings for Hypotheses One and Two, as predicted, replicate a generally 
consistent pattern of role perception-affective response relationships found in both 
educational and noneducational settings. Each role related perceptual variable (role 
conflict, ambiguity, overload, and powerlessness) showed a significant zero order 
correlation of moderate strength with each affective work response (job satisfaction 
and emotional exhaustion). However, because the correlations among the role per- 
ception variables were roughly of the same order as the correlations between these 
variables and the affective responses, multicollinearity was a fact that affected the 
results of the regression equations. The total equations were significant, but not all 
role variables showed individual significance in the equations when all four role 
perceptions were considered simultaneously. 


MacPherson’s (1985) study of elementary and secondary principals in Nova 
Scotia makes a useful comparison for discussing patterns of relationships between 
the role variables and emotional exhaustion. In the present study, role overload, 
powerlessness, and role conflict, in descending order of explanatory power, all 
showed individually significant relationships with emotional exhaustion in the re- 
gression equation. Among MacPherson’s principals, role conflict was also signifi- 
cantly related to emotional exhaustion, while role ambiguity was not. Perceived 
degree of autonomy described as “freedom to operate their schools as they saw fit” 
(p. 3) was also negatively related to exhaustion. Autonomy, so defined, seems a 
polar opposite of powerlessness; thus the resulting patterns in both principal 
studies appear quite similar along two dimensions. MacPherson’s study contained 
no variable comparable to role overload. 


The principal occupies a boundary spanning position. Occupants of such posi- 
tions are susceptible to numerous conflicting demands from both internal and ex- 
ternal sources. These conflicting demands were related to principal emotional 
exhaustion. To be an effective leader of a school and its social system, the principal 
also needs to hold clear expectations for the role. Principals seemed to find role 
ambiguity less of a problem in their leadership role and less stress evoking than 
conflict. The dual position of leader and boundary spanner generates much work. 
Most emotionally exhausting to principals in this study was the volume of work 
generated by the role. As a source of exhaustion, role overload was followed closely 
by a sense of powerlessness to control events in the school, though the mean level 
of this sense was quite low among the principals in the sample. 

Remembering that the level of emotional exhaustion experienced by principals 
was not high, the four role related variables did explain a high proportion—almost 
two-thirds—of the variance in emotional exhaustion. With this level of prediction, 
the results of this study have practical as well as theoretical implications. Role 
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stressors were not only theoretically related to emotional exhaustion. By contrast, 
the same four variables explained only half as much variance in job satisfaction. 
This proportion of variance is still noteworthy. These findings, along with the cor- 
relation of —.61 between satisfaction and emotional exhaustion, also provide a first 
response to Friesen, Holdaway, and Rice’s (1984) call for an examination of the re- 
lationship between perceived dissatisfaction and stress. 


Reducing the negative aspects of role is a goal which would likely bring actual 
and desirable personal outcomes for high school principals. Such reduction would 
be more likely to reduce pain than to bring joy. It is, however, unrealistic to suggest 
that role conflict, ambiguity, overload, and powerlessness can be drastically re- 
duced in the principalship. They come, to some degree, with the proverbial terri- 
tory. 

The failure of Hypothesis Three remains an enigma. Puzzling indeed is the 
finding that neither role perceptions nor affective responses were generally related 
to the thrust variable and that age ranges moderated the relationships such that 
only 40-46-year-old principals demonstrated the predicted relationships. There are 
only questions, not answers. For those interested in the age group findings, further 
discussion is available elsewhere (Travlos, 1984). 


Several other issues may be related to the failure of Hypothesis Three. First, 
the operation for thrust may not be a sensitive measure of principal behavior. Not 
only was the mean for thrust quite low, but the standard deviation indicated little 
variation among schools. Second, survey research bears the liability of potential re- 
sponse bias when variables are not measured independently. It is possible that the 
intercorrelations of the principal self-report variables were inflated because of 
method bias. If this were the case, then the correlations with thrust were lower, 
since it was the only independently measured variable. 


A note on the measurement of role ambiguity may also be helpful to future re- 
searchers. The alpha coefficient for ambiguity reached only .62 in this study, 
indicating a good deal of error variance in the measurement and attenuation of the 
correlations beween this and other variables. A decision was made to use the 
shortest scales with reasonable reliabilities in order to reduce the length of the 
survey—since surveys do contribute to role overload for principals! Alphas in past 
studies have averaged .70 for the Abdel-Halim (1978) scale. Researchers might be 
advised to trade off the overload cost of four additional questions and use the 
longer Rizzo, House, and Lirtzman (1970) scales for role conflict and ambiguity in 
order to increase reliability of measurement. 

Relationships between role overload, powerlessness, and affective responses to 
work appear especially worthy of more investigation in educational administration. 
The vast majority of research involving role perceptions in all vocational areas has 
concentrated on role conflict and ambiguity rather than other role perceptions. In 
education only one prior study (Kottkamp & Mansfield, 1985) has investigated the 
powerlessness-emotional exhaustion relationship, and we know of no prior study 
that has investigated the role overload-emotional exhaustion relationship. The 
magnitude of the zero order correlations suggests that further study of these vari- 
ables should be fruitful in understanding both occupational stress and job satisfac- 
tion among educational administrators. 

The basic issue of examining leaders for potential relationships between per- 
ceived role stressors, affective responses, and behavioral responses with organiza- 
tional consequences still seems worthy of pursuit. Most existing studies probe role 
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perception-affective response relationships rather than role perception-behavioral 
response relationships, and most of the research which does consider behavioral 
outcomes was conducted with workers at the lower levels of manufacturing firms. It 
is an important issue to understand the extent to which perceptions of stressors 
and affective responses to them are related to leadership behavior. Pursuit of this 
issue will require care in the choice of behaviors to be assessed and creativity in 
structuring more independence in measurement. 


Note 
ile This article is based on a revision of a paper presented at the Annual Meeting of the American Educational Research Asso- 
ciation, New Orleans, 1984. 
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PERSPECTIVES 


Making Research Matter 


Edward A. Holdaway 


University of Alberta 


In many countries researchers have expressed concern over the perceived lack of im- 
pact of their findings upon social policies. The relationship between research and pol- 
icy has been the subject of several recent major conferences and publications. Many 
explanations have been presented together with related suggestions for increasing im- 
pact. However, the complexity of the relationship and the need to distinguish be- 
tween different types of research and different types and levels of policy have often 
been overlooked. Indirect long-term impact is probably the most important research 
contribution. The ways in which social science research contributes to knowledge re- 
quire more examination. 


Since the mid-1960s, and especially during the last decade, many writers have 
focused their attention on the impact of research upon social policy. This arose 
partly from a concern that a great deal of research, involving considerable effort 
and expense, appeared to have little effect upon policy, and partly from the need to 
show practical benefits from research in order to defend research expenditures. 
The purposes of this article are to present evidence that this concern persists inter- 
nationally, to reassess the reasons why direct impact continues to be less than ex- 
pected, to suggest an attitude which independent researchers should adopt, and to 
propose some approaches which could increase the impact of policy-relevant re- 
search. Selected recent literature, personal experiences, and interviews conducted 
by the author in several countries are used as sources. 

Throughout this article, “research” is viewed as “systematic critical investigation 
directed toward increasing the general body of knowledge or understanding of the 
subject studied” (Radford & McDowell, 1973, p. 11). As Nisbet and Broadfoot 
(1980, p. 42) have emphasized, research obviously includes a very wide range of 
activities and methodologies. Some writers, including the author, categorize 
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“evaluation” as a type of research, whereas others classify evaluation and research 
as separate activities. “Policy” is viewed as either official statements or unwritten 
understandings which direct or guide behavior. 


Persistence of Widespread Concern and Interest 


Evidence of increasing attention to the impact of research upon policy is pro- 
vided by the following publications: (a) books prepared by Nisbet, Megarry, and 
Nisbet (1985), Husén and Kogan (1984), Bulmer (1982), Adams, Smelser, and 
Treiman (1982), and Nisbet and Broadfoot (1980); (b) special articles in Adminis- 
trative Science Quarterly in 1982; and (c) the 1981 Special Issue of the 
International Review of Education. Note must also be taken of the important 
works of Coleman (1972), Taylor (1973), Caplan, Morrison, and Stambaugh (1975), 
Weiss (1977), Suppes (1978), and Mitchell (1980). The topic has received attention 
from various organizations including the US National Academy of Sciences, the So- 
cial Science Research Council of Great Britain (recently renamed the Economic 
and Social Research Council), the Australian Council for Educational Research, the 
Foundation for Educational Research in the Netherlands (SVO), the Council of 
Europe, and the US National Research Council. As Lindblom and Cohen (1979, 
p. 1) have stated, “In public policy making, many suppliers and users of social re- 
search are dissatisfied, the former because they are not listened to, the latter be- 
cause they do not hear much they want to listen to.” Information and opinions 
garnered from Canadian and other sources attest to the prevalence of this condi- 
tion. 


In a Science Council of Canada study which began in 1978, Karen Fish (1983) 
conducted 40 interviews with Members of Parliament (Ottawa), Senators, parlia- 
mentary staff members, and members of scientific organizations. She concluded 
that “scientific and technological factors are peripheral to the economic, social and 
political concerns determining parliamentary decisions” (p. 9). This study was un- 
dertaken “to assess government’s ability to respond to ‘the complex and divisive is- 
sues that arise from the interaction of science and human values,’,” (p. 5). The 
associated difficulties in policy making are exemplified by the commonly expressed 
views that more research should be conducted before action is taken to reduce the 
environmental effects of acid rain. Another important, recent example at the fed- 
eral level in Canada is provided by the decisions of the Department of Justice and 
Transport Canada to obtain “scientifically accurate data that will help them make 
informed decisions about the areas for which they have responsibility,” including 
action to reduce the severe consequences of accidents caused by drinking drivers 
(Supply and Services, 1985, p. 11). 


The Canadian social science community has been actively involved in the 
research-policy debate as evidenced by articles and reports by Breton (1977), 
Deiseach (1978), the Social Science Federation of Canada (1983), and Coleman and 
LaRocque (1983). These have mainly addressed the contributions of research and 
the reasons why such contributions are often less than expected. On the other 
hand, questions are frequently raised about the apparent lack of relevance of much 
Canadian social science research. Andre Fortier (1978, pp. 11-12), former SSHRCC 
President, observed that “Many MPs are worried about taxpayers’ money being 
spent on projects which do not have any bearing on Canadian problems. Journalists 
increase this concern by writing sensational articles.” This perception was shared 
by John Trent (1982, p. 2), formerly Executive Director of the Social Science 
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Federation of Canada: 


Few Canadian social scientists recognize that we are also criticized for being esoteric and 
irrelevant [and] for not being a source of creative ideas in these times of socioeconomic 
crisis. .. . it is clear that given our current level of personnel and resources, governments 
expect social scientists to be seen to be making greater contributions to national, applied, 
interdisciplinary, problem-oriented research. 

In Britain, Nisbet and Broadfoot (1980) produced one of the most comprehen- 
sive treatises on this subject. They identified the failure of “the greatly increased 
investment in educational research and curriculum development” to meet “raised 
hopes and expectations” as a major source of concern in the UK over the research- 
policy gap. They also felt that “growing sophistication and self-awareness among 
educational researchers led to aspirations to hold an established place in the educa- 
tional system” which were apparently not realized. Evidence of continued interest 
in this area was provided by the recent meeting of a group of social scientists with 
MPs in the House of Commons to explore the possibility that they could play a 
greater part in British policy making (Wotjas, 1984). Levy and Healey (1984), how- 
ever, cautioned that a danger exists in overselling the contribution of social science 
to the management of change in the current technological revolution. In a related 
comment, Hamilton (1984) stated that greater coordination was needed if educa- 
tional research wanted to overcome its fragmented impact upon British educational 
policy. 

Similar views have been expressed in the United States. For example, in the 
1978-1979 Report of the US National Council on Educational Research, the Chair- 
man, John Corbally (1980, p. 1), stated that the brunt of concerns raised by mem- 
bers of Congress, their staff, and interested groups after the first year of existence 
of the National Institute of Education was, “You spent all of that money (around 
$140 million in the first ‘golden year’) and the schools still aren’t any better.” Also, 
Caplan (1982, p. 36), after conducting a study in which over 200 upper-level federal 
executives were carefully questioned, concluded that formal or informal contact be- 
tween social scientists and senior executives was rare. In order to understand more 
clearly the impact of social science research findings on policy and so focus atten- 
tion upon the formulation of educational policies by US state legislatures, the Na- 
tional Institute of Education funded a study by Mitchell (1981) who collected data 
from 160 “key legislative policy influentials” in Arizona, California, and Oregon. 
Mitchell’s (1981, p. 299) major conclusion was that “scientific expertise makes im- 
portant contributions to legislative policy only if policymakers are oriented toward 
a reliance on expert authority and authentic scientific expertise is available to 
them.” In connection with this type of activity, Holden (1981, p. 1398) observed 
that a lot of people are making lists to demonstrate all the ways that social and be- 
havioral research has contributed to society. 

The same types of concerns have been expressed in New Zealand. For example, 
Geraldine McDonald, in her 1980 NZARE Presidential Address, remarked that: 

Nevertheless there appears to be a strong urge both in New Zealand and in countries 
other than our own to try to bring research and policy into a closer relationship, to bring 
relevant research to the attention of those who make decisions, to understand the prob- 
lems of decision makers, and to analyze the processes by which policy is determined. 

Since 1977, the Australian Council for Educational Research (ACER) (1980, 
p. 30) has undertaken a research program “specifically designed to inform social 
policy decisions.” This program consists of three groups of studies: (a) evaluation of 
selected policies and programs; (b) transition from school to adult life; and 
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(c) allocation of staff and resources to schools. ACER feels that such research con- 
tributes evidence to assist decision making as well as analyzing and understanding 
the issues being considered. Also, Dunn (1981) cited the opinion of the former 
Chairman of the Australian Schools Commission that research has had little impact 
upon the work of the Commission should have had more. Further, some valuable 
insights into the importance in policy making of politicians and state directors-gen- 
eral of education in Australia have been provided in the eight monographs of the 
University of Melbourne project entitled The Education Policy Process at State 
Level: An Australian-United States Comparative Study (Project Directors, 
G.S. Harman & F.M. Wirt). These monographs include many examples of policies 
which have been adopted by premiers, governments, and directors-general, follow- 
ing differing amounts of public involvement. 


Explanations 


Before presenting the reasons why research commonly has had less impact upon 
policy than is expected, three major points must be emphasized. First, even the 
most cynical must accept that some research activities have had an obvious and di- 
rect impact upon the formation of policies. The National Academy of Sciences 1977 
Report assessed that systematic, disciplined inquiry has helped in the transforma- 
tion of education over the century, citing examples such as greater access to educa- 
tion and the development of more humane attitudes. That report also stated that 
people throughout the world see and talk about reality differently as a result of 
what Freud, Dewey, Thorndike, Skinner, Piaget, and Mead have written and said 
(Kiesler & Turner, 1977, pp. 11-12). Research findings have also been influential in 
(a) the comprehensivization of Swedish secondary schools (Husén, 1978, p. 545); 
(b) positive discrimination by use of additional funding for “deprived areas” identi- 
fied in the Plowden Report (Bulmer, 1982, p. 41); and (c) the decision to delete 
French from British primary schools (personal communication Yates, 1982). 
Second, we must recognize that some educational policies, such as the provision of 
“a balance of social, physical, emotional, and moral development” for students 
(Education Department of South Australia, 1981), have little obvious research base. 
Third, the relationship between research and policy must be viewed as more com- 
plex than was originally supposed (Foster, 1979, p. 93). The complexity of the 
research-policy relationship was referred to by Caplan (1982, pp. 32-33) in this way: 

As a result of recent empirically-based studies, it has become evident that even with the 
increased production of policy-relevant knowledge and improved technological proce- 
dures for transfer and dissemination, the frequency of use and impact of knowledge has 
not increased substantively. Simply because information is timely, relevant, objective, 
and given to the right people in usable form, its use has not been guaranteed. . . . Obvi- 
ously, utilization is not a simple process. 
This is compounded by the complexity of the processes involved in gaining knowl- 
edge and forming policy. Weiss (1982) coined the terms “knowledge creep” and “de- 
cision accretion” to indicate that obtaining increases in knowledge and arriving at 
decisions are both slow, gradual processes. Warren Piper (1980) considered that 
governments may be well informed by social science, but are weak in producing 
workable solutions. Mitchell (1981, p. 271) added further insight into the complex- 
ity issue by presenting these conclusions based on six empirical survey studies of 
social science utilization: (a) the concept of “social science” is complicated and diffi- 
cult to interpret; and (b) “utilization” has several different dimensions as variations 
occur in the frequency and the extent of use of the findings. 
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Many reasons have been advanced in the literature to explain why educational 
research activities have not obviously affected educational policy in a substantial 
and direct manner. A comprehensive categorization of these reasons has been pre- 
sented by Holdaway (1982). Many relate to differences in orientation, knowledge, 
and time-frames of policy makers and researchers. Henkel and Kogan (1981, p. 39), 
Nuthall (1975, pp. 4-5), Caplan (1982, p. 33), and Davey (1982, p. 1) have all 
affirmed that policy makers prefer to rely on their own judgment. In Caplan’s 
words (1982, p. 33): 


Bureaucratic, ethical, attitudinal, and social considerations take precedence over the val- 
ue of information in its own right, and that is probably the way it should be. After all, so- 
cial problem-solving is a political process and it is neither desired nor expected that it 
should be otherwise. 
Crossland (Boyle, Crossland, & Kogan, 1971, p. 190) lent support to Caplan’s con- 
clusion in describing Britain’s change to comprehensive schools as being influenced 
by basic value judgments about equity, equal opportunity, and social division, as 
well as views about education. Another aspect was noted by Mitchell (1981, p. 299) 
who found that “scientific expertise makes important contributions to legislative 
policy only if policymakers are oriented toward a reliance on expert authority and 
authentic scientific expertise is made available to them.” 


Also identified as a reason for less than expected impact are the reports written 
by social scientists. For example, Crelinsten (1984, p. 2) considered that, in at- 
tempting to be scientific, social scientists have given their research a bad name by 
writing obscure, pedantic, and jargon-filled articles. Davey (1982, p. 1) concluded 
that politicians commonly view research as biased, outdated, long-winded, aca- 
demic, and impractical. Researchers are also criticized because they are used to 
making different explanations, which is viewed as hedging by laymen (Paulston, 
1982, p. 9), because they fail to deliver the findings relevant to crucial decisions 
(Baumgart & Low, 1979, p. 77), and because their studies lead to conflicting results 
(Averch, Carroll, Donaldson, Klesling, & Pincus, 1972, p. 2). Steinle (1981, p. 606) 
added another significant concern—administrators do not have the time to read all 
of the relevant research. 

Further, as Wirt and Mitchell (1982, p. 2) concluded, knowledge will be utilized 
only if it enhances political influence, simplifies policy problems, or assists in iden- 
tifying effective solutions. Stuart Smith (1984), President of the Science Council of 
Canada, considered that social scientists have not been sufficiently aggressive in 
demonstrating applications of their findings to policy. Another major reason relates 
to “time”: Whereas politicians work within a very short time frame, researchers can 
rarely conduct the necessary studies in the time available. Tizard (personal com- 
munication, 1982) observed that politicians who set up a project are often gone by 
the time the project is completed. Levy (personal communication, 1982) has also 
noted that many disciplines, such as economic history, are always closely related to 
policy, whereas others such as archaeology are not. 

Another explanatory thrust has been provided by Lindblom and Cohen (1979, 
pp. 10-29) in their discussion of “ordinary knowledge” and of social learning and in- 
teractions as complementary alternatives to professional social inquiry (PSI). In 
their view, social scientists underestimate society’s use of “new ordinary knowledge” 
which comes from “common sense, causal empiricism, or thoughtful speculation 
and analysis” (p. 12). PSI is seen to add little real new knowledge and much of it is 
ordinary knowledge, such as “television absorbs much of children’s time.” Most PSI 
practitioners “rely heavily on the same ordinary techniques of speculation, 
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definition, conceptualization, hypothesis formulation, and verification as are 
practiced by persons who are not social scientists or professional investigators of 
any kind” (p. 15). 

These views of Lindblom and Cohen about the sources of “ordinary knowledge” 
were not fully supported by Weiss (1982, p. 623) and Rule (1982, p. 4) who argued 
that, because social science knowledge and concepts become assimilated with 
everyday knowledge and language, policy makers commonly are unaware that they 
are using research findings. Prewitt (1981) provided examples of social science con- 
cepts common to our vocabulary including human capital, identity crisis, span of 
control, price elasticity, and acculturation. 


Lindblom and Cohen (1979) also claimed that policy formation can be heavily 
influenced by social learning; for example, an energy policy may be more accept- 
able after people have waited in line for gasoline. They also concluded that “we do 
not well understand that interaction is a method of problem solving” (p. 20). 
Interactive problem solving was defined by them as “to undertake or stimulate 
action—usually interaction—so that the preferred outcome comes about without 
anyone’s having analyzed the given problem or having achieved an analyzed 
solution to it” (p. 20). They classify voting, including legislative voting, as interac- 
tion. 


Obviously, many explanations have been proposed for the perceived lack of im- 
pact. Probably all of these explanations have some validity in the case of individual 
policies. The caution of Lindblom and Cohen (1979) concerning the marginal 
growth of knowledge through professional social inquiry, is important, even if we 
can identify some research results, such as Herzberg’s well known two-factor theory 
of motivation, which were not part of ordinary knowledge. However, a need still ex- 
ists for research findings to assist policy making. As Inger Marklund (1981, p. 14) 
has stated: “Perhaps the greatest paradox of all is that, at the same time as more 
and more is being banked on R & D at the political level, confidence in the prob- 
lem solving capacity of R & D is declining.” The existence of a “chicken and egg” 
situation in this regard was noted by Nisbet and Broadfoot (1980, p. 45): Research 
lacks impact because it is underdeveloped, but its lack of direct use contributes to 
continuing underdevelopment. 


Types of Impact 
Further explanations can be derived from a consideration of the three different 

types of impact identified by Nisbet and Broadfoot (1980, pp. 20-21): 

1. Immediate, direct impact. This is perceived to be rare except when topics are 
noncontroversial and values are not challenged. Taylor (1981, p. 190) consid- 
ered that research which has the strongest short-term impact may not have the 
greatest long-term influence, but its publicity could generate further studies 
which advance knowledge and improve the rationality of decision making. 

2. Intermediate impact. This occurs with evaluation and developmental work, 
which is seldom published, and which obtains its original impetus from re- 
search. 

3. Long-term, indirect impact. Nisbet and Broadfoot saw this impact as that 
which “influences the climate of opinion” and concluded that it “is increasingly 
recognized in the literature as the most important and far-reaching.” Authors 
who supported this view include Angus (1981, p. 575), Getzels (1978, 
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pp. 487-488), Nuthall (1975, pp. 4-5), Powell (1980, p. 2), Power (1981, p. 14), 

Wagenaar, Kallen, and Kosse (1982, p. 8), and Weiss (1982, p. 621). They per- 

ceived that basic research is more influential in the long run than is research 

focused upon a particular practical problem because it has the capacity to pro- 
vide different conceptualizations, to reformulate problems, to furnish new in- 
sights, to sharpen perceptions, and to stimulate discussion and questioning. 

Adams, Smelser, and Treiman (1982, p. 102) drew this conclusion: 

The power of basic research to improve and enrich our lives grows out of the mutual rein- 
forcement and synergism of many interlocking ideas, findings, and practical outcomes. It 
cannot be understood and properly utilized if we concentrate instead on isolated, 
product-centered outcomes. 

Another indirect aspect was raised by Breton (1977, p. 45) who reasoned that pol- 

icy research will have an impact upon policy makers if it affects their cognitive sys- 

tem, that is, the system of ideas, postulates, and perceptions of reality. Further, 

Beyer and Trice (1982, p. 605) contended that the results of applied research are 

“often not generalizable to other situations.” 

Bulmer (1982) has provided two models which place many of the foregoing ex- 
planations in perspective. The engineering model, which is called the physical sci- 
ence model by some authors, presumes that scientific findings will lead to desired 
end-states agreed upon by both researchers and decision-makers, whereas the en- 
lightenment model presumes that one purpose of research is to provide back- 
ground information which can alter the conceptions of policy makers. This latter 
model of indirect impact obviously is favored by most writers who have examined 
this field. Another useful categorization was provided by the National Academy of 
Sciences 1977 report entitled Fundamental Research and the Process of Education 
(Kiesler & Turner, 1977, pp. 8-11) which proposed these four spheres of influence 
of research: 

1. A view of reality—research influences how people view the world, the lan- 
guages that they use, and the values they attach to their observations; 

2. A vision of the achievable—research influences our expectations and 
understandings, for example, what can be achieved by handicapped children 
and what limitations are imposed on underprivileged groups; 

3. Know-how—research may lend support to our current practices, and we should 
not always consider that research should lead to improvement; and 

4. A commitment to act—research can provide data to support the view that an 
activity (e.g., teaching) is worthwhile, as well as assisting in improvement, 
altering conceptions, and deepening work commitment. 

A different set of models of the process by which research enters social 
problem-solving was presented by Weiss (1977): 

1. Decision-driven model—research responds to a problem identified by decision 
makers; 

2. Knowledge-driven model—research presents an opportunity that can be capi- 
talized upon by decision makers; and 

3. Interactive model—research is part of a complex search for knowledge from a 


variety of sources. 
This third model is the one favored by Lindblom and Cohen (1979) as described 
earlier. Weiss’s first model relates to the work of commissions or committees of in- 
quiry, which many authors see as having influenced policy far more than have indi- 
vidual research projects. To a large extent this phenomenon has resulted because 
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governments established commissions to investigate and report upon areas in which 
governments already have decided that policy changes are desirable. Bulmer (1982, 
p. 99) claimed that “Commissions may also act as a device to enable governments 
to do what they want to do anyway, sometimes clothing it in the legitimacy pro- 
vided by research.” 

After reviewing the impact of Canadian commissions investigating postsecond- 
ary education (PSE) from 1965-1980, Ahamad (1980) concluded that some were 
more influential than others primarily because of timing, membership, presenta- 
tion, and marketing of the report and implementation plans. Further, Sheffield 
(1982, p. 51) observed that 

Even though some provincial inquiries have not had their recommendations accepted, it 
must be said that such projects make use of existing research results and create more. As 
a consequence, the stock of knowledge about PSE is increased and enriches the resources 
available to future researchers and decision-makers. 
James Coleman (1984, p. 132) has made an interesting, related observation as a re- 
sult of experiences with his US Equal Education Opportunity Report: Some social 
policy research is used in unintended ways. For example, research associated with 
his report was never used in formation of civil rights education policy, but it was 
considered in resolution of conflicts arising from local school desegregation. 


Lord Rothschild (1982, pp. 18-21) supported the need for research when com- 
missions are considering policy options. In his review of the British Social Science 
Research Council, he observed that failures in public policy and planning since 
1945, such as the building of high-rise public housing apartment blocks, are “in 
part attributable to the lack of adequate enquiry into social facts.” He felt that 
slum clearance and the building of new towns required “organized enquiries of the 
kind that social scientists are trained to undertake,” and concluded that 

there are, and will continue to be policy issues to which empirical research, requiring high 
academic standards in the enquiry, are directly relevant. Such research is, at the very 
least, a check on the credibility of entrenched common sense beliefs, which may not corre- 
spond with the facts. 

This view calls into question the appropriateness for policy making of “ordinary 
knowledge” as described by Lindblom and Cohen. Of course, the concern that so- 
cial scientists feel about the impact that their work has or could have, as in re- 
search on the areas identified by Lord Rothschild, would depend upon their 
perceptions of their role. In this regard, Dunn (1981, pp. 519-525) provided a useful 
distinction between “technocratic guidance” and “technocratic counsel.” The tech- 
nocratic guidance perspective emphasizes a power shift to policy analysts and away 
from policy makers, generated by “changes in the structure of government and the 
nature and scope of social problems” and the view that “policy-relevant knowledge 
is an increasingly scarce resource.” The technocratic counsel perspective empha- 
sizes that an increase in policy-analysis activities enhances the power of dominant 
groups through legitimization of their policy decisions. Dunn concluded that nei- 
ther perspective is accurate, as the technocratic guidance approach exaggerates the 
power and influence of analysts while the technocratic counsel approach exagger- 
ates the symbolic importance of legitimization. However, these two perspectives do 
produce ideal types which aid analysis and understanding. 
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Suggestions 


Those who have examined the impact of research upon policy have made many 
suggestions for increasing this impact. These suggestions can be categorized under 
the headings of improving research and improving dissemination and interaction. 


Improving Research 


Suggestions for improving research include the use of more thematic research 
which is focused upon particular topics: This would allow greater concentration of 
research and the obtaining of more information relevant to an area of policy inter- 
est. This approach has been employed in recent years by the Social Sciences and 
Humanities Research Council of Canada, although it has been criticized by the ad- 
vocates of independent, scholar-generated research. Others recommend more 
triangulation—the use of different methods simultaneously or successively, in an 
attempt to obtain greater reliability and thereby more faith in the results. Further 
suggestions include narrower delineation of research problems, more frequent use 
of theoretical frameworks, more emphasis on illuminating case studies, and even 
greater concentration on basic research which many see as having the most long- 
term utility. 

Another set of suggestions relates to making research more directly “policy-rele- 
vant.” Coleman and LaRocque (1983, pp. 247-248) have discussed this matter as 
follows: 

In order for research findings to be utilized . . . research should be addressed specifically 
to policy-makers, should investigate problems which are of concern to them, and should 
be designed to provide data and interpretations which have predictive utility concerning 
the probable consequences of policy options. Furthermore PRR [policy-relevant research] 
should concern itself with those policy options which are important to policy-makers, be- 
cause they are normatively acceptable and have some face validity as solutions to the per- 
ceived problems. . . . Additionally, policy-relevant research should be concerned with 
utility at every stage of the research, rather than addressing this as a postscript on policy 
implications, and hence should: 

a. focus on variables manipulable by policy makers... and 

b. be sensitive to the political environment and to uses likely to be made of the re- 

search findings. 


McDonald (1976, pp. 1-2) also advocated that consideration of policy alternatives 
should pervade the work of educational researchers and that “the policy alterna- 
tives available to the policy makers should be carefully laid out as a first step in 


planning research.” 


Improving Dissemination and Interaction 
Suggestions about communication include the development of mutual confi- 
dence by researchers and policy makers, greater collaboration and cooperation, 
sharing of orientations, better understanding of the research-policy relationship (as 
in Mitchell’s work), improved dissemination techniques, and greater monitoring of 
research findings having potential relevance to areas of policy interest. Dube (1982, 
p. 7) has succinctly addressed several of these matters: 
It is necessary to establish better rapport and communication between the social scien- 


tists and the policy makers. Often the technical jargon of the social scientists puts off the 
practical minded administrators and policy makers. The social sciences findings have to 
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be communicated to them in a different idiom. And bridges have to be built between the 
two so that their uneasy relations are replaced by more harmonious and interpenetrating 
work relationships. 
However, Weiss (1982, p. 637) has cautioned that “adherence to all the traditional 
strictures,” including “jargon-free communication,” has only a marginal effect upon 
the actual application of results. 

Further, Mitchell (1981, p. 153) recommended that universities should develop 
training programs for legislative staff to improve their capabilities in location and 
interpretation of policy-relevant social science findings. At the same time, he 
cautioned that “interpretation of social science is every bit as complex and profes- 
sional a task as is its creation.” Nisbet and Broadfoot (1980, p. 12) emphasized that 
change is not effected merely as a result of telling educators about research find- 
ings. Another dimension was highlighted by Lindblom and Cohen (1979, pp. 2-3): 
The traditional parochial views about inadequate utilization (e.g., unsatisfactory 
personal relations between researcher and client) ignore the wider issue that man is 
a social problem-solver who does not rely to a great extent on research findings. 


Analysis 

The wide variety of views on many aspects of the impact of research upon social 
policy, which Lindblom and Cohen (1979, p. 1) rightly called “a river of mixed diag- 
nosis and prescription,” require some analysis and synthesis. Such an activity nec- 
essarily involves subjective assessment and selection based partly upon personal 
experience, preference, and force of argument. A summary of the major points ad- 
vanced earlier in the article is first presented. Second, attention is given to aspects 
which were missed or unemphasized. Third, some areas of disagreement are re- 
ported. Fourth, some suggestions are presented for researchers and those interested 
in the topic of research impact. 


Summary 
The major points or lines of argument developed thus far are as follows: 

1. Many writers have paid attention, over the last two decades, to the apparent 
lack of impact of research findings in social science, including education, upon 
policy formation; 

2. This interest is persisting on an international basis; 

3. The perceived lack of impact is of concern to both researchers and policy 
makers; 

4. Policy usually results from social, economic, and political forces and from per- 
sonal sentiment rather than from research findings; 

5. Some obvious cases of direct impact have been identified, especially when the 
research has been related to commissions of inquiry; 

6. Many reasons and suggestions have been advanced for explaining and reducing 
the lack of direct impact; 

7. The greatest contribution of research is through indirect influence by clarifying 
issues, providing data, sensitizing policy makers, and providing conceptual ana- 
lyses; 

8. At the same time that the contribution of social science research towards pol- 
icy problems and the solution of social problems is being questioned, policy 
makers in a wide variety of arenas are requesting that more research be 
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conducted to assist them; 


9. The relationship between research and policy is complex, resulting in part from 
different types of research, different types of policy, and different types of ap- 
plications of research findings; and 


10. Much of the knowledge used by policy-makers comes from sources other than 
social research. 


Underemphasis 


Some aspects of the phenomenon appear to have been either omitted or some- 
what underemphasized in the cited literature. First, the term “relationship” is prob- 
ably better than “contribution” or “impact” to emphasize that the research-policy 
linkage is interactive rather than unidirectional. To illustrate, Houston (personal 
communication, 1982) has pointed out that policy and policy-making have had a 
profound effect upon the research enterprise in many western countries. In particu- 
lar, he cited the work of Federal Government commissions in Australia in initiating 
a great deal of research activity. Second, few writers have considered the existence 
of policies at different levels: in education, for example, we can have policies at the 
federal, state/provincial, district, and school levels. Mitchell’s study of three US 
legislatures, although highly important, focused only on the impact of research on 
state policies. Similar research on impact at other political and administrative 
levels is warranted. Third, the epistemological concerns raised by Lindblom and 
Cohen, especially with respect to the relationship of ordinary knowledge to social 
science knowledge, have not been addressed by most authors. Although Lindblom 
and Cohen may have somewhat overstated the minimal contribution of our re- 
search to “new” knowledge, their view deserves serious consideration by all educa- 
tional researchers. To what extent are we producing “new” knowledge as compared 
with knowledge which is already in the public domain? Fourth, the difficulty of 
ascribing causality to actions has received little recognition, except through discus- 
sion of the major indirect effects. Even identifying the influence of relevant re- 
search findings upon a particular policy decision is a very difficult exercise. A fifth 
point, made by only a few writers, relates to the distinction between scholar-gener- 
ated research and “in-house” research, such as university/college institutional re- 
search or directed/contract research performed for a commission or committee of 
inquiry or government agency. The findings of such research, clearly targeted upon 
a particular problem area, are far more likely to have substantial impact. Hence, 
those reflecting about the research-impact relationship should clearly distinguish 
between these two major types of research activity as they have different purposes 
and consequently different associated expectations. Those analyzing in-house re- 
search activity, which may require researchers to act as policy advisors, should take 
cognizance of Dunn’s polar extremes of technocratic counsel and technocratic guid- 
ance. 

Sixth, although improved interaction between researchers and policy makers 
(both elected and appointed officials) is frequently encouraged, insufficient empha- 
sis has been placed upon analyzing the types of interaction leading to policy that 
Lindblom and Cohen have discussed. Their view is supported by Warren Piper 
(1980) who argued that we need to develop models of human interaction involving 
the application of occupational and social psychology to educational management. 
The seventh point to be made in this section relates to policy studies. Laswell’s dis- 
tinction between studies of policy and studies for policy was deemed useful by 
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Downey (1984, p. 1) who saw the first type as relatively unproblematic and the sec- 
ond type as somewhat controversial. This again highlights the importance of identi- 
fying different types of research when generalizations about impact are being 
made. Downey (1984, p. 1) also proposed that the disagreements concerning the use 
of research, different sources of knowledge, and the roles of analysis and interac- 
tion in policy development (Lindblom and Cohen, 1979) “ought to become the 
bases of continuing dialogue between those who do policy research and those who 
shape policies for education.” Lastly, only a few writers refer directly to the exist- 
ence of different types of impact, such as the classification presented by the Na- 
tional Academy of Sciences (Kiesler & Turner, 1977). 


Disagreements 


Probably the major disagreement observed in the literature relates to the issue 
of whether impact can be increased, and if so, by what means. Many advocate that 
the impact should be increased and that this can be achieved by improving existing 
or using new procedures. However, others consider that we should not endeavor to 
increase direct impact, because the greatest impact is indirect, occurring through a 
variety of measures which alter perceptions. 


Suggestions 


The study of the relationship between research and policy obviously is wide- 
ranging, complex, and important, and warrants our continuing attention. My as- 
sessment is that virtually all of the concerns, explanations, and suggestions voiced 
by others have some validity. Even if Lindblom and Cohen’s view of social knowl- 
edge generation is accepted, there surely is room for more interaction between re- 
searchers and policy makers, especially with respect to policy-relevant research. No 
one view should prevail, to the exclusion of others, in the debate about how to 
make research matter. 


Moreover, educational researchers should neither confuse nor combine basic re- 
search and policy-oriented research. Basic research can have a long-term effect on 
policy (and practice), but immediate, direct linkage should not be expected. A great 
deal of educational research conducted by university staff and students has the pri- 
mary purpose of increasing our knowledge and understanding. To expect that such 
research will necessarily have policy implications does a disservice to the research 
and policy communities as well as to the activity of educational research. The 
suggestions made by Coleman and LaRocque (1983) are pertinent here. All too fre- 
quently both academic staff and graduate students in education conduct theoreti- 
cal research, whose aim is to advance knowledge and understanding, but in the 
concluding sections feel obliged to discuss the implications of their findings for pol- 
icy without having paid any reference to such implications throughout the design, 
data collection, or data analysis phases. Accordingly, the relationship between re- 
search and policy should receive more attention in graduate courses on research 
methodology. 


A fertile area for further examination is the means by which social science find- 
ings relate to our knowledge, through assimilation which changes vocabulary and 
attitudes, through finding empirical support for “ordinary knowledge,” or occasion- 
ally through the relatively direct impact of policy-oriented research. In view of the 
implications of policies for practice, and the various concerns described in this 


article, such continuing examination of the research-policy relationship is easily 
justified. 
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THE EDITOR’S PAGE 


Put Up Time 


A key recommendation in Canadian Research on Education: A State of the Art 
Review sponsored by the Canadian Society for the Study of Education in 1981 was 
that the Social Sciences and Humanities Research Council organize the establish- 
ment within education of priority areas for research of national interest which 
would serve as the focus for programmatic research. The editors of the report, John 
Andrews and Todd Rogers, argued that such focusing of research will lead to in- 
creased productivity and quality, aid in the establishment of a research tradition, 
and result in an improved capacity for research in education. 

SSHRC has now responded favorably to this recommendation by including re- 
search related to education and work in a changing society in its strategic grants 
program. Within this broad theme, projects may deal with skill development, work 
habits and attitudes; the goals and roles of those affected and involved; institu- 
tional and systemic responsiveness to change; and/or the influence of an increas- 
ingly competitive international environment. Thus the interests of most academics 
can likely be accommodated. 

Three kinds of grants are now available for the first time: (a) seed money grants 
of up to $5,000 for the development of proposals; (b) research workshop grants of 
up to $15,000 for enhancing knowledge and skill in problem definition, design, and 
methodology; and (c) strategic research grants of up to $10,000 for one year, or 
$250,000 over three years, for relevant projects, including pilot or feasibility 
studies. For 1987 awards, the deadline for receipt of application is April 1 with re- 
sults to be announced in September 1987. 

The good news is that these grants are available. The bad news is that they put 
at risk the reputation of the scholarly community in Canadian education. For the 
number of proposals submitted will attest to the authenticity of our interest in the 
advancement of knowledge; and the nature of the proposals will demonstrate the 
quality of our intellectual inquiry. Our collective response will be a public test of 
the sincerity and strength of our commitment to the view that the functions of edu- 
cation in our society are so vital to the well-being of our country as to require that 
research directed toward its improvement be undertaken. 

It is now put up time. 

We must put up. We must put up or be branded as academic charlatans. And 
we must put up or contribute to the triumph of ignorance through the continued 
application of yesterday’s solutions to tomorrow’s problems. 

We must put up or have only ourselves to blame. 


WHW 
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University of Alberta 


JAMES C. SARROS 


Victorian Ministry of Education, Australia 


and 


NICHOLAS AIDOO-TAYLOR 


Technical Teachers’ College, Kurnase, Ghana 


Organizational Stress and Coping: 
A Model and Empirical Check 


A conceptual model tracing the relationships among organizational stress sources, 
stress effects, and coping strategies used by personnel is presented, followed by a 
study of organizational stress among district-based administrative and supervisory 
personnel. The study revealed that work load and interpersonal relationships were 
major sources of stress. The frequency of the stresses encountered decreased as or- 
ganizational level of responsibility increased, although the intensity of stress for each 
stress source increased. The findings suggest that experiences of stress are role 
specific. That is, district-based administrators experienced more role conflict and role 
ambiguity than other educational personnel but registered the lowest levels of stress 
for other work stressors. Major coping strategies used by these administrators were 
support from immediate family and support from work colleagues. 


Stress and how to cope with it in organizations are frequent topics of discourse, re- 
search, and writing both within the fields of education and in organizations gener- 
ally. However, Schuler (1984) asserted that “there is still a great deal not known 
about stress or coping with stress in organizations” (p. 1). Even less is known about 
the causes and consequences of work stress among educational administrators. Fur- 
ther, studies are inconclusive in their findings (e.g., Farkas, 1984; Jankovic, 1983; 
Koff, Laffey, Olson, & Cichon, 1980; Meléndez & de Guzman, 1983; Swent & 
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Gmelch, 1977; Welch, Medeiros, & Tate, 1982). This report is a response to the 
continuing concern about the sources and effects of stress and coping with stress in 
an educational context. 


Purpose 
The purpose of the article is fourfold: 


1. To present a conceptual model which would be useful in tracing the relation- 
ships among the multitude of sources of stress, effects of stress, and coping 
strategies in an educational organization; 


2. To report the findings of a study of organizational stress among district-based 
educational administrative and supervisory personnel in a large Western Cana- 
dian urban school district; 


3. To identify the specific effects of stress on the individual and the organization; 


To outline the coping strategies used by each subgroup of district-based per- 
sonnel. 


Conceptual Design 

The study was delimited to an examination of distress, defined by Selye (1974, 
p. 5) as a negative form of stress which arises from unpleasant experiences. This 
form of stress, unlike the positive eustress, is far from therapeutic and is almost al- 
ways destructive. Because of its deleterious effects on individuals and ultimately on 
the organizations in which these individuals work, it was felt that a more complete 
conceptualization than was heretofore available of the stress experienced by educa- 
tional personnel—and others—was needed. Such a conceptualization would reveal 
the relationships that exist among a variety of variables associated with stress in an 
organizational context and should contribute to an understanding of stress in or- 
ganizations. This should help identify information needed for decisions aimed at 
enhancing the job design and work roles of educators. In addition, it was felt that 
the conceptualization would assist the researcher and practitioner to address the 
suggestion by Organ (1978) that “we need to know much more about the organiza- 
tional sources of stress, the conditions under which stress is more or less tolerable, 
and individual differences that moderate or mediate the effects of stress” (p. 305). 


Stress has been generally conceptualized as an individual psychological, physio- 
logical, and behavioral response to a situation that is perceived as presenting a de- 
mand which threatens to exceed the individual’s capacities for meeting that 
demand (e.g., Baum, Singer, & Baum, 1981; Hiebert, 1984; McGrath, 1976; 
McLean, 1979). This transactional definition of stress, according to Cox and 
Mackay (1981, p. 101), focuses on “the active and adaptive nature” of the 
transaction between the individual and the organization. From this perspective, 
transactional models of stress serve as appropriate models of organizational stress. 
The model of stress developed for the purposes of this article is a composite of the 
transactional stress models designed by Cooper and Marshall (1976, p. 11), 
Ivancevich and Matteson (1980, p. 44), and Kyriacou and Sutcliffe (1978, p. 3). 

As the model of organizational stress presented in Figure 1 reveals, within-or- 
ganizational sources of stress or stressors may be physical, individual, group, or 
organization-wide in origin. In this study, eight factors of work stress identified in a 
factor analysis of 50 stress items and two dimensions of role stress—role ambiguity 
and role conflict—were found to be the major intraorganizational stressors. Among 
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these intraorganizational stressors, the physical stressors were identified by an in- 
adequate physical resources factor and a noise factor. Individual stressors were 
identified by the factors of lack of respect (by others, for one’s professional field) 
and job uncertainty. The group stressors were of two sorts: an unsatisfactory rela- 
tionships with colleagues factor and an unsatisfactory relationships with 
superordinates factor. Organization-based stressors included the role stress items 
making up the role conflict and role ambiguity scales, as well as the stress factors 
of work load and program constraints. 


Ce ee nnn ne nnn eel 


Stressors Individual Characteristics Stress 


Sex, age, personality, higher-order Effects on Responses of 
needs, beliefs-values-attitudes, 


tolerance for ambiguity 


Intraorgan- INDIVIDUAL 
izational 

e Physical Immediate Immediate 
e Individual 

¢ Group COPING Long-term Long-term 


STRATEGIES 


e Organizational 


e Individual 
e Group 
e Organizational 


APPRAISAL 


ORGANIZATION 


Extraorgan- 
izational 

¢ Family problems 
e Life crises 

° Financial 
concerns, etc. 


Immediate Immediate 


Long-term Long-term 


Figure 1: Model of organizational stress. 


This organizational model of stress sources and stress effects assumes that 
stress is the result of the individual’s appraisal of a potentially stressful work situa- 
tion. Coyne and Lazarus (1980) defined appraisal as “the person’s continually 
reevaluated judgments about demands and constraints in ongoing transactions with 
the environment and his or her resources and options for managing them” (p. 150). 
Appraisal is influenced by individual characteristics such as sex, age, the psycho- 
logical Type A and Type B behavior syndromes, and personal values and beliefs. 


Whenever organizational stressors are perceived as threatening to personal well- 
being, efforts are made to employ coping strategies. Baum, Singer, and Baum 
(1981) asserted that “the perception of danger motivates a search for coping 
responses that will reduce this threat” (pp. 9-10). The use of a coping strategy may 
moderate but not necessarily negate experiences of work stress. Stress may have 
both immediate and long-term consequences for the individual. The extent to 
which immediate stress responses to coping have long-term consequences may be a 
function of individual coping strategies employed and the organizational support 
provided (e.g., Shinn, 1982, p. 79). Some immediate effects of stress for the individ- 
ual are increased blood pressure and rapid heart rate, which cause momentary dis- 
tress (e.g., Selye, 1974, pp. 25-30). If stress persists, long-term consequences such as 
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job dissatisfaction and burnout may occur (e.g., Blase, 1986; Cherniss, 1980a, 
1980b; Maslach, 1976, 1977; Veninga & Spradley, 1981). 


Individual responses to work stress have been implicated in the etiology of work 
performance decrement, poor mental health, and psychosomatic diseases (e.g., 
Davidson & Cooper, 1981; Frese, 1985; Ivancevich, Matteson, & Preston, 1982; 
Kyriacou & Pratt, 1985). Individual stress responses have specific effects on organi- 
zations such as poor quality service, poor staff relationships, absenteeism, and fluc- 
tuations in work productivity. Some long-term organizational effects include 
negative competition among subsystems, poor communication networks, isolated 
administrative decisions, and deteriorating service (e.g., Carroll & White, 1982, 
p. 43). Long-term organizational consequences of these effects are employee 
turnover and transfer, formalization of rules and regulations, and new personnel 
policies intended to curtail further employee disenchantment and stress. Cherniss 
(1980b) asserted that burnout as a consequence of work stress “contributes to a de- 
cline of staff job satisfaction . . . [and] can affect turnover, absenteeism, and 
interpersonal conflict” (p. 32). The feedback loops in Figure 1 from individual and 
organizational stress responses indicate that the situation remains stressful, esca- 
lates in intensity and frequency, or is moderated depending on the quality and 
success of reappraisal by the individual. The experiences of work stress by district- 
based educational administrators reported in this study are described within the 
context of the conceptual model of organizational stress outlined above. 


Sample 

The primary focus of the empirical component of this paper is on the 134 district- 
based administrative and supervisory personnel who participated in a system-wide 
survey of 2,829 school-based and district-based educational personnel in a large 
Western Canadian urban school district undertaken by Ratsoy and Friesen (1985). 
Other groups involved in the survey were nonadministrative school-based person- 
nel (N=2,478), and principals and assistant principals (N=217). These responses 
represented a 67° return from all such personnel in the school district and a re- 
turn of 76% from the district-based personnel. 

The 134 district-based administrators and supervisors included 43 specialists 
such as speech clinicians and pychologists, 57 subject consultants, 17 supervisors 
responsible for coordinating the services of consultants, six directors responsible for 
coordinating the services of specialists and supervisors, and 11 assistant and associ- 
ate superintendents in charge of instructional services and school support services. 


Instrumentation 
The questionnaire used to collect the data was designed especially for the study 
and contained seven sections, (a) respondent demographic information including 
global occupational stress and personal life stress items, (b) 50 items on frequency 
and intensity of work-related stresses, completed by all educational personnel in 
the district, (c) 25 additional items for school-based staff, (d) 18 items relevant to 
principals and assistant principals, (e) role ambiguity and role conflict scales 
(Rizzo, House, & Lirtzman, 1970), (f) a work-related burnout scale (Maslach & 
Jackson, 198la, 1981b), and (g) open-ended and semiopen-ended questions on the 
nature and effects of work-related stresses affecting the respondent, on the long- 
term consequences of the work-related stresses experienced, on the support services 
available to the respondent, and on the degree of satisfaction with the district’s 


stress coping procedures and systems. 
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As indicated earlier, this paper reports findings for the eight stress factors pro- 
duced by the factor analysis of the 50 stress items completed by all educational 
personnel in the system-wide survey of the district. The eight stress factors identi- 
fied were: work load, unsatisfactory relationships with superordinates, unsatisfac- 
tory relationships with colleagues, lack of respect, job uncertainty, program 
constraints, inadequate physical resources, and noise. Also included are the find- 
ings for the two role stress variables of role conflict and role ambiguity. 


Respondents were asked to indicate how frequently they experienced work-re- 
lated stressors, and how stressful they were for them. Frequency of stress was re- 
corded on a 6-point scale ranging from “Never” to “Very Frequently.” Intensity of 
stress was recorded on a 6-point scale ranging from “Not Stressful” to “Extremely 
Stressful.” 


Analyses 

Data transformation performed on the fixed-response items included frequency 
and percentage frequency distributions of the responses to the stress items, factor 
analysis of the items using varimax rotation, and analyses of variance. Open-ended 
responses were subject to content analysis. The open-ended responses provided by 
the district-based personnel added further insights into sources and consequences 
of work-related stress. This procedure was in keeping with the views of Carnoy 
(1979) who recommended content analysis “when source material is used to com- 
plement some other kind of data during an enquiry into attitudes” (p. 64). 


Study Findings 

The results of the analyses of variance for the three major subgroups of respond- 
ents who participated in the system-wide survey are presented in Table 1. For each 
stress factor, the district-based personnel were on the average experiencing less 
stress than nonadministrative and administrative school-based personnel. For five 
of the eight stress factors the mean stress score was significantly lower (.05 level) 
than that of at least one of the other groups. In contrast, district-based administra- 
tors recorded highest means for role ambiguity and role conflict, and in both cases 
their mean was significantly higher than that of the school-based non- 
administrative personnel. 


Nonadministrative school-based personnel (primarily teachers, but also includ- 
ing department heads, librarians, and counselors) recorded significantly higher 
levels of stress than the district-based personnel (assistant and associate superin- 
tendents, directors, supervisors, subject consultants, speech clinicians, and psychol- 
ogists) for five of the eight stress factors. Work load was the key stressor for all 
three groups, with no significant differences among them on this variable. This 
finding is consistent with Blase and Pajak’s research reported elsewhere in this 
journal. Their study of 80 secondary school teachers found that excessive work 
demands were major contributors to stress and had deleterious effects on teachers’ 
personal lives. The finding that nonadministrative school-based members of staff 
were experiencing significantly more stress than administrative personnel supports 
research which indicates increasing work-related pressures and tensions among 
teachers are related to their experiencing job dissatisfaction, stress, and burnout 
(e.g., Anderson & Iwanicki, 1984; Blase, 1982; Cedoline, 1982; Chapman, 1983; 
Friesen & Williams, 1985; Kalker, 1984; Kyriacou & Pratt, 1985; Litt & Turk, 1985; 
Schwab & Iwanicki, 1982b). 
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TABLE 1 


One-Way Analyses of Variance for Stress Scores of 
District-Based and School-Based Personnel 


AS A EE ae a eB ea et SE 
Computed Means for Each 
Item* 


A B C Overall 
Mean Significantly 


Stress Variable N= (134) (217) (2478) (2829) Different 
Groups 
1. Work load 3.1b 8:1 3.28 3.2 None 
2. Unsatisfactory 2.6> 2.7 2.88 2.8 None 
relationships 
with superordinates 
3. Unsatisfactory 2.5) p78 Pat) 2.6 None 
relationships 
with colleagues 
4. Lack of respect 2.40 2.9 2.98 2.9 A-B, A-C 
5. Job uncertainty 2.9b 2.6 2.88 2.7 A-B, A-C, 
B-C 
6. Program constraints 2.4) Be 2:9 2.9 A-B, A-C 
7. Inadequate 2,1> 23 2.64 2eD A-C, B-C 
physical resources 
8. Noise 2.4 2.7 2.98 2.8 ALCIB-C 
9. Role ambiguity** 2.88 2.6> 2.6 2.6 A-C 
10. Role conflict** 3.48 2.60 2.9 3.0 AC BC 


A=District-based administrators; B=Principals and assistant principals; C=Nonadministrative school- 
based personnel. 


*Original response categories for degree of stress were the following: 1=Not stressful; 2=Mildly stressful; 
3= Moderately stressful; 4= Considerably stressful; 5= Very Stressful; 6= Extremely stressful. 


**Original response categories for role stress ranged from 1=Definitely NOT TRUE of my job to 
7=Extremely TRUE of my job. 


4Indicates highest stress level for this factor or role stress variable. 


bindicates lowest stress level for this factor or role stress variable. 


Principals and assistant principals reported the highest level of stress on the 
program constraints variable for which their scores, along with those of their 
school-based nonadministrative colleagues, were statistically higher than those of 
the district-based personnel. Feeling powerless to alter prescribed programs may 
have contributed to feelings of stress among these school-based personnel. 
Principals and assistant principals had significantly higher stress scores for 
inadequate physical resources than the other two groups. In this regard, Welch 
et al. (1982) mention that principals are experiencing stress and burnout in record 
numbers because they are “faced with the problem of doing more with less .. . 
individual autonomy is declining” (p. 44). The findings of the present study tend to 
support the last assertion; however, the claim that principals are experiencing 
burnout in “record numbers” seems too strong. For instance, the relatively low 
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levels of role stress—both conflict and ambiguity—recorded by principals and 
assistant principals challenge established theory and research which suggest the 
boundary-spanning role of administrators encourages feelings of role conflict and 
role ambiguity (e.g., Caldwell & Doremus, 1978; French & Caplan, 1973; Gmelch, 
1977; Miles, 1975, 1976a, 1976b; Miles & Perreault, 1976). Instead, the findings 
from the present study support the claim of Gorton (1982) that “most high school 
principals are coping rather well with the difficulties of their job and are not 
experiencing a high level of stress” (p. 199). 

Generally, the findings of the present study suggest that levels of organizational 
stress decrease as organizational level of responsibility increases. For instance, 
district-based administrators recorded the lowest mean scores for each of the eight 
stress factors. In contrast, their higher levels of role stress indicate that 
experiencing stress may be role specific. The “executive” role and associated 
activities of district-based personnel may leave them susceptible to differing 
expectations and at times to insufficient information, encouraging feelings of role 
conflict and ambiguity. 


Within Group Comparisons 

The results of analyses of variance for the five subgroups of district-based person- 
nel are presented in Table 2. As has already been indicated, the major stressor for 
district-based personnel and for school-based personnel was work load, registering 
an overall mean of 3.1. The factor with the lowest mean for the district-based per- 
sonnel was inadequate physical resources. 


When subgroups were examined individually, each of the five subgroups had at 
least one mean stress score higher than the means for the other groups. For ex- 
ample, the assistant and associate superintendents (E) exhibited higher levels of 
stress for work load, lack of respect, and role ambiguity compared with the other 
subgroups; these senior level administrators also registered the lowest levels of 
stress on four factors; however, none of these means scores was significantly differ- 
ent from the means for the other four subgroups. 


The only difference in means to reach statistical significance was between 
specialists and consultants for the program constraints factor. Specialists exhibited 
significantly more stress than consultants on this stress factor. 


Overall, it is safe to generalize that differences among the five subgroups of 
district-based personnel, in terms of the levels of stress experienced, were minimal. 


Open-Ended Responses 
As indicated in Table 3, analysis of the open-ended responses revealed that work 
load was identified as the major stressor by 47% of respondents. Items identified 
as constituting work load included (a) an inadequate amount of time to plan and 
accomplish tasks satisfactorily, (b) too many meetings, (c) unrealistic deadlines, 
(d) too much paperwork, and (e) unreasonable expectations. Unsatisfactory rela- 
tionships with superordinates was mentioned by 22% of respondents. These 
district-based administrative personnel referred to their lack of participation in de- 
cision making and the role of feedback on performance as aspects of these relation- 
ships. 

Both role conflict and role ambiguity were identified by 10% or more of 
district-based personnel as sources of stress. Examples of the role-related stress 
sources included (a) playing the middleman between teachers and _ parents, 
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TABLE 2 


One-Way Analyses of Variance for Stress Scores of District-Based Personnel 


ee ee ee eee 
Computed Means* 


A B Cc D E Overall 
Mean Significantly 


Stress Variable N= (3) (Grn) (a) (1B) Gay) 1185) Different Groups 

1. Work load 31°31" 309% 352) 3h None 

2. Unsatisfactory relationships 2.6) 26 2.99 28 26 2.6 None 
with superordinates 

3. Unsatisfactory relationships 25 2.6 2.99 22 21> 95 None 
with colleagues 

4. Lack of respect 22 2.5 24 2.00 984 24 None 

5. Job uncertainty 295 89:38 S19 O10 1.8> Dep None 

6. Program constraints 2.82 22 23 296 Pry gs! A-B 

7. Inadequate 22 21 19 254 1.75 914 None 
physical resources 

8. Noise 2.64 22 2.6 125 18° 24 None 

9. Role ambiguity** 27 3.0 255 26 31 28 None 

10. Role conflict** LWW eee heey nacir None 


A=Specialists; B= Consultants; C=Supervisors; D= Directors; E= Assistant/Associate Superintendents. 


*Original response categories for degree of stress were the following: 1=Not stressful; 2= Mildly stressful, 
3= Moderately stressful; 4= Considerably stressful; 5= Very stressful; 6= Extremely stressful. 


**QOriginal response categories for role stress ranged from 1=Definitely NOT TRUE of my job to 
7=Extremely TRUE of my job. 


8Tndicates highest stress level for this factor or role stress variable. 


Pindicates lowest stress level for this factor or role stress variable. 


(b) dealing with novel situations, (c) dealing with conflicting responsibilities and 
expectations, (d) lack of long-range goals and expectations, and (e) lack of clarity 
about span of control. 

When subgroups were examined individually, and consistent with the 
quantitative data, work load was the major stressor for all subgroups. If frequency 
of response is any indication of severity of the stressor, assistant and associate 
superintendents (E) appeared more concerned with work load (72%) than any 
other subgroup. This finding complements the findings for the fixed-item 
responses. Also consistent with the data presented in Table 2, supervisors 
identified unsatisfactory relationships with superordinates as a major source of 
stress. In comparison, while directors recorded the second highest level of stress for 
this factor, they did not allude to it in their open-ended responses. 

Role ambiguity featured as a stressor for all subgroups, although role conflict 
was identified only by specialists, consultants, and supervisors. Szilagyi (1977, 
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TABLE 3 


Stress Sources Identified in the Qualitative Analysis 


A B C D E Total 

Stress Variable N=(%=)  48(32) 57(438) 17(12) 6(5) 11(8) 134(100) 

1. Work load 20(46) 26(45) 6(35) 3(50) 8(72) 63(47) 

2. Unsatisfactory relationships 7(16) 15(26) 5(29) == AXE) 29(22) 
with superordinates 

3. Unsatisfactory relationships — 2(3) 1(6) — — 3(2) 

with colleagues 

4. Lack of respect 1(2) = — = = 1(1) 

5. Job uncertainty 5(11) 5(8) — — at 10(7) 

6. Program constraints 4(9) 4(7) 8(6) 

7. Inadequate physical resources 7(16) 2(3) — 2(38) — 11(8) 

8. Noise a = = 

9. Role ambiguity 5(11) 4(7) 1G) LG) 8 (27) 14(10) 

10. Role conflict 4(9) -8(14) 5(29) = == 17(13) 


A= Specialists; B= Consultants; C=Supervisors; D= Directors; E= Assistant/Associate Superintendents. 


p. 384) asserted that role conflict occurs at lower organizational levels where 
employees are incapable of changing rules, procedures, and resources in an attempt 
to mitigate role conflict. This sense of powerlessness may explain the presence of 
role conflict for these administrators. In contrast, even though directors recorded 
the highest mean for role conflict in the fixed-response section of the questionnaire 
(Table 2), this was not mentioned as a problem area in their open-ended responses. 


Consequences of Work Stress 

Respondents in the study were invited in open-ended items to indicate the imme- 
diate effects of work stress and the long-term consequences of this stress. Some im- 
mediate effects included (a) frustration, anger, and disappointment, (b) fatigue and 
health problems, (c) job dissatisfaction, (d) inefficient work habits, and (e) little 
peer interaction. Specific comments made included the following: “jumping from 
one crisis to another,” “not as productive as I can be,” and “inefficient use of time, 
procrastination, and poor quality of service.” 


The reported long-term consequences of stress included health problems, fail- 
ure to achieve organizational and personal goals, and the loss of collegiality. 
Specialists and consultants appeared more concerned with health-related problems, 
whereas consultants, directors, and the assistant and associate superintendents 
seemed more concerned with lack of accomplishment of professional goals, and a 
deterioration in peer contact and family life. The specific comments made in- 
cluded: “loss of mission or direction,” “cynicism and mistrust,” “losing touch with 
former friends,” and “poor mental and physical health.” Many of these comments 
are representative of symptoms of burnout as identified by Welch et al., (1982), 
who mentioned the loss of willingness to relate to others, the sense of social isola- 
tion, a “tendency toward self-blame . . . [and] feelings of worthlessness” (p. 52). 
Generally, major long-term consequences of stress were reported to be chronic 
health problems, job dissatisfaction, frustration, and loss of collegiality. 
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Stress Coping Strategies 

Respondents were provided with a list of common sources of support for coping 
with stress and were asked to indicate which of these they used when experiencing 
job-related stress. On average, respondents selected three sources of support. These 
major sources of support and the responses for the five subgroups of district-based 
personnel are presented in Table 4. 


As indicated in the table, the immediate family and colleagues at work were the 
major sources of support in coping with work stress mentioned by 72% and 70% 
respectively of these district-based personnel. Two other strategies used were help 
from friends (49% of respondents) and recreational activities such as playing golf, 
fishing, or reading (48%). Seeking help from superordinates and one’s church were 
less frequently mentioned as stress-coping strategies. 


When subgroups were examined individually, specialists (A), directors (D), and 
associate and assistant superintendents (E) reported asking for help from work col- 
leagues as their major coping strategy, followed by help from the immediate family, 
engaging in extraorganizational activities, and help from friends. Supervisors (C) 
and consultants (B) used family support as a major coping strategy, followed by 
help from colleagues at work. Help from friends and extraorganizational activities 
were next in frequency of mention by consultants, whereas activities and 
superordinate assistance were the next ones used by the supervisors. A smaller per- 
centage of specialists, consultants, and supervisors turned to superordinates for 
help in handling work stress than did directors and the assistant and associate su- 
perintendents. 

When respondents were requested to indicate their satisfaction with district- 
run programs used to facilitate coping with job stress, 28% of respondents reported 
they were unaware of any stress-coping programs in the district. A further 35% in- 
dicated satisfaction with the programs, and 37% indicated dissatisfaction with the 
district’s stress-coping programs. Primarily, specialists and consultants were the 
ones most likely to be unaware of the district’s stress-coping facilities; no 
supervisors, directors, or assistant and associate superintendents indicated a lack of 
awareness. These last three subgroups also were more satisfied with the stress-cop- 
ing programs than were specialists and consultants. It is noteworthy that personnel 
in the last two categories mentioned, although district-based, worked primarily at 
or from schools in the system, whereas the remainder were located at the district’s 
central office. 


Discussion 
Because this study was a system-wide survey of district-based educational adminis- 
trators, the findings technically apply only to those respondents and to their juris- 
diction. However, to the extent that administrators in other school districts have 
work situations comparable to those of the respondents, some generalizability of re- 
sults may be warranted if the appropriate caution is exercised. 

The present study investigated sources, conditions, and consequences of stress 
among educational administrators. A conceptual model of organizational stress 
illustrated the relationships among research variables. This model was derived 
from research which suggests that an individual’s perception of the situation and 
its capacity to become threatening or challenging, harmful or benign determines 
the level of experienced stress (e.g., Anderson, 1985; Coyne & Lazarus, 1980; 


McGrath, 1976). 
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TABLE 4 
Sources of Support for Coping With Work Stress 


A B ¢ D E 

Sources of : 

Support N=(% =) 43(22) 57(43) 17(12) 6(5) 11(8)  134(100) 
1. Immediate family 28(65)  42(74) 14(82) 5(83) 7(64) 96(72) 
2. Friends 21(49) 32(56)  3(18) 4(66)  6(55) 66(49) 
3. Church 13(30) 6(11) 1(6) — 3(27) 23(17) 
4. Colleagues at work 33(77)  37(65) 10(59) 5(83) 9(82) 94(70) 
5. Activities (e.g., golf, TV) 29(67)  22(39) 6(35) 4(66) 4(36) 65(48) 
6. Superordinates 6(14) 13(28) 4(24) 3(50) 5(45) 31(28) 
7. Other 3(7) 2(4) — a = 5(4) 
8. Nosupport available 1(2) 2(4) — — — 3(2) 
9. Nohelp needed 2(5) 1(2) 1(6) = = 4(3) 


A=Specialists; B=Consultants; C=Supervisors; D= Directors; E= Assistant/Associate Superintendents. 


The findings of this study indicate that all subgroups of district-based 
educational administrators perceived work load as the most stressful aspect of their 
jobs. Personnel higher on the district’s organizational chart (assistant and associate 
superintendents) recorded a somewhat higher level of stress associated with work 
load than did the others (specialists, consultants, supervisors, and directors). Other 
major stress sources were unsatisfactory relationships with superordinates at work, 
role ambiguity, and role conflict. Work load and poor interpersonal relationships 
have been identified as major sources of stress for educators (e.g., Blase, 1982, 1986; 
Bloch, 1977; Caldwell & Doremus, 1978; Crowson & Porter-Gehrie, 1980; Farber, 
1984a, 1984b; Griffiths, 1977; Koff et al., 1980; Kottkamp & Travlos, 1986; 
McGuire, 1979; Milstein, Golaszewski, & Duquette, 1984; Warner, 1980). Nebgen 
(1978) stated that in the school context, “unanimity seldom exists among teachers, 
administrators, parents, and students” (p. 1). Welch, Medeiros, and Tate (1982) 
maintained that an educational administrator’s interactions with people are almost 
always negative because of difficult personnel problems, disputes between parents 
and teachers, and discipline problems. For this reason, these administrators are 
condemned “to at least partial failure all of the time” (p. 48). 


Research findings indicating that role ambiguity is more apparent at higher 
levels of organizational responsibility (e.g., Abdel-Halim, 1978, 1981; Mattingly, 
1977; Miles, 1976a; Schuler, 1980; Szilagyi, 1977) are only in a very modest way 
borne out by this study. The results of the present research suggest that 
experiences of stress may be role specific. For instance, directors had the highest 
mean stress score for role conflict, and assistant and associate superintendents 
recorded the highest mean for role ambiguity, possibly as functions of the specific 
roles they occupied. However, no statistically significant differences were recorded 
for mean role stress scores. 

Of some interest is the finding that district-based administrators reported 
higher levels of role stress and lower levels of stress in response to specific work 
factors compared with other educational personnel (see Table 1). These findings 
are consistent with research which indicates that experiences of stress are role 
specific (e.g., Farkas, 1984; Gorton, 1982; Koff et al., 1980; Ratsoy & Friesen, 1985). 
That is, teachers and principals may experience more work stress because of their 
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daily involvement with incessant and often unpredictable demands. In contrast, 
district-based administrators may have the necessary resources and autonomy 
needed to mitigate their sources of work stress. 


Both the quantitative and qualitative data collected in the study revealed that 
directors, consultants, and specialists, more frequently than the others, reported 
inadequate resources as a source of work stress. Although none of the differences 
among groups was statistically significant, it may be worthy of note that 
supervisors had the highest mean score for stress related to interactions with 
superordinates and with colleagues, while registering the lowest mean scores among 
the five groups for role conflict and ambiguity. Apparently, the responsibility for 
coordinating the consultancy service in the district resulted in some stressful 
encounters with personnel. This issue requires further investigation to ascertain 
what are the underlying work factors associated with these stressful encounters. 


Also of some concern were the unsatisfactory interactions among supervisors 
(C) and directors (D) in their relationships with superordinates, and the 
relationships between supervisors and colleagues in general. Identifying sources of 
stress associated with interpersonal relationships is a critical area of research 
necessary to improving organizational morale and sense of purpose. As failure to 
cope with interpersonal relationships increases, the tendency to experience stress 
and burnout also increases. For instance, Maslach (1982) claimed that poor 
interpersonal relationships can contribute to “emotional stress . . . [and] rob the 
individual of a very valuable resource for coping with and preventing burnout” 
(p. 42). 

Two major methods for coping with stress were identified by the respondents. 
Directors, superintendents, and specialists more frequently emphasized collegial 
support as a coping strategy, and consultants and supervisors relied more 
frequently on assistance from the immediate family. The help of superordinates 
was sought by less than 25% of all respondents. These findings are consistent with 
those of Swent and Gmelch (1977, pp. 33-35), whose study of Oregon school 
administrators revealed that sharing problems with colleagues and the acquisition 
of interpersonal and management skills were major coping strategies. Shinn (1982, 
p. 79) asserted that group and organizational coping mechanisms are more effective 
than individual coping techniques. Administrators in this study appeared to favor 
group coping strategies over personal strategies such as avoidance or palliation 
through increased dependence on alcohol or other forms of depressant. 


The results of this study also indicated that recreational activities served as a 
coping mechanism. This finding supports research by Howard, Rechnitzer, and 
Cunningham (1975, p. 325), who revealed that regular sleep, exercise, and good 
health habits proved effective coping strategies for company managers in Ontario. 
Because stressors are ubiquitous, school administrators and central office personnel 
need to learn how to identify those coping strategies most effective in dealing with 
individual work stressors, or with any combination of work stressors. Such training 
in the identification of potential work stressors can serve to stimulate employee 
psychological and physiological well-being, while also enhancing organizational 
morale. 

Overall, 35% of respondents were satisfied with the stress-coping programs in 
the district, while 37% were dissatisfied. This finding indicates that measures 
should be taken to examine the utility of such district-based programs for district- 
based administrators, and to suggest alternative approaches in dealing with work 
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stress. Are the programs too intimidating or lacking in sufficient anonymity? These 
questions need to be addressed, and other reasons need to be examined. The 
apparent lack of confidence in superordinates as a form of social support in coping 
with work stress may not come as a complete surprise. Why would one go to one’s 
boss for help in coping with work-related stress? Might this not be interpreted as a 
sign of incompetence by the superordinate? However, for the good of the 
organization, there may be an advantage in knowing that personnel in the “lower 
ranks” are experiencing stress as a result of work load or other role-related factors 
due to the actions of those “higher up.” 


The current belief among researchers is that educators are not suffering unduly 
from work stress, at least when compared with certain high stress occupations. 
However, any form of distress in education is a matter of concern. As the personal 
comments of respondents indicate, work stress is implicated in the development of 
ill health, frustration, job dissatisfaction, a declining sense of personal achievement, 
and a deterioration in interpersonal relationships with peers and family. 


Fergusson (1984, p. 97) stated that educator stress and its implications for 
individuals and school systems requires immediate attention. Friesen and Williams 
(1985, p. 31) note that there is a need for further study and development into the 
causes and consequences of stress in education. The authors of this paper agree 
with this need for more research on stress in education. Using the model of 
organizational stress developed for the purposes of this paper, we have attempted 
to highlight how specific intraorganizational stressors accounted for differential 
stress responses among district-based educational administrators. Sources of coping 
with work stress were identified, and both immediate responses to and long-term 
consequences of work stress were outlined. Further examination of the conceptual 
model underlying this research is needed in order to assess more fully the 
effectiveness and utility of specific coping strategies in dealing with work stress. 
Particular attention should focus on whether coping strategies are more effective 
prior to the immediate stress response, or as intervening variables between 
immediate effects and long-term responses. In this regard, and as reported in this 
article, social support as a buffer to immediate stress effects and a moderator of 
prolonged stress needs further research, specifically in an educational context. 
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Teacher Expectations Regarding the 
Impact of Two Staff Reduction Policies 


School administrators in British Columbia and other jurisdictions have been faced 
with the difficult task of reducing the size of their teaching force. Although knowledge 
of the teachers’ opinions regarding the various reduction-in-force options would help 
administrators in their choice of action, little research has been conducted on 
teachers’ attitudes toward human resource policies. This study examined the 
opinions of a sample of teachers in two British Columbia school districts regarding 
their outcome expectations and overall evaluation of two reduction-in-force actions. 
One alternative included an early retirement program while the other did not. The re- 
sults indicated that the teachers sampled had a neutral opinion of the option with 
the early retirement program and viewed the option without early retirement even 
less favorably. This pessimism was also reflected in the organizational outcomes they 
expected to result from each of the two options. Overall, opinions were associated 
most strongly with two possible outcomes: the general quality of education and teach- 
er morale. 


With decreasing enrolments and tighter government fiscal policies, many Canadian 
and US school districts have recently been faced with the need to reduce the num- 
ber of teachers that they employ. In some areas, the magnitude of these reductions 
has been quite dramatic. In some districts of British Columbia, for example, school 
enrolment has decreased by over 30% from the peak experienced during the late 
1970s. In addition, the provincial government’s financial restraint program an- 
nounced in 1982 has established higher pupil-teacher ratio targets for elementary 
and secondary school systems. The result for most school districts in British Co- 
lumbia has been an oversupply of teaching staff. Administrators addressed this 
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problem first by trying to increase their budgets through negotiations with the pro- 
vincial government and, failing that, by reducing other costs in order to avoid re- 
ducing the teaching complement. Increasing revenues through local taxation has 
also been considered, but school boards have been reluctant to move in this direc- 
tion for a variety of reasons, including the current economic recession. Even with 
capital cost reductions, it has been necessary to reduce the number of teaching 
staff in several British Columbia school districts. 


School district administrators have a number of options available to reduce the 
number of teaching staff (Bishop, 1979; Cuban, 1979; Luce, 1983). First, there are 
policies which temporarily reduce the number of full-time equivalent teachers 
while still keeping the original number of teachers on the payroll. Depending on 
contractual obligations and other constraints, administrators can institute job shar- 
ing arrangements whereby two or more teachers “share” one position (Growly, 1972; 
Meltz, Reid, & Swartz, 1981). Alternatively, an agreement could be reached where 
all staff reduce their hours of work. Voluntary temporary layoffs or leaves of 
absence could also be encouraged, particularly by allowing the affected teachers to 
continue their participation in the pension plan and other employee benefits 
(Scharffe, 1983). 


Various human resource policies can also be considered which permanently re- 
duce the number of teachers in the school district. A hiring freeze is one of the 
least painful options of this sort because the number of teachers is reduced only 
through attrition. Unfortunately, as one study discovered (Luce, 1983), turnover 
declines so dramatically during times of restraint that this option rarely satisfies all 
the staff reduction needs of the organization. Voluntary terminations, usually in 
the form of “buyout” packages, have also been offered in order to reduce staffing 
costs. 

More commonly, administrators have turned to permanent layoffs, even though 
this option is one of the least popular (Phelan, 1983). Layoff criteria vary. Teacher 
associations argue that administrators should rely solely on seniority as the deter- 
mining factor. Alternatively, these decisions are sometimes determined by the 
teacher’s performance evaluations and the school district’s needs in specific subject 
areas. 

Finally, early retirement policies have been introduced. Here the organization 
encourages teachers to retire before their normal superannuation date by offering 
special financial incentives. These incentives may include a few years’ salary or a 
top-up of pensionable earnings for those near retirement age. 

In determining the best manpower policy for reduction in force, superinten- 
dents and other school administrators must carefully examine both the short-term 
and long-term implications of these actions (Flynn & Niven, 1982). In particular, 
human resource practices can affect the performance, commitment, and satisfac- 
tion of teaching staff, the degree of cooperation and conflict throughout the school 
system, and the overall quality of the educational system. Some of these problems 
may become immediately apparent while others emerge only after several years if 
the human resource practice is continued. 

The immediate impact of reduction-in-force and other human resource policies 
is usually clearly recognized by administrators. For example, it is evident that lay- 
offs create uncertainty regarding job security. This uncertainty tends to demoralize 
teachers and increases the incidence of burnout (Phelan, 1983; Scharffe, 1983). 
There is also evidence that resource scarcity generates conflict (Schmidt & Kochan, 
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1972). Specifically, Whetton (1980) has suggested that organizational decline tends 
to exacerbate interpersonal and interunit conflict. This conflict is likely to occur 
with across-the-board reductions in particular, because the scarcity is spread 
throughout the entire organization rather than localized within a few subunits. De- 
pending upon which reduction-in-force practices are used,and how effectively they 
are implemented, a significant loss of motivation and commitment amongst teach- 
ing staff can also result (Levine, 1984; Luce,1983). 


Although the choice of reduction-in-force policies such as hiring freezes, early 
retirement, and layoffs are typically decided on the basis of the immediate or 
short-term organizational outcomes, the long-term demographic changes that these 
policies create should not be overlooked. Indeed, the long-term demographic conse- 
quences can be much more tenacious and costly to the organization than the more 
immediate concerns. Organizational demography refers to the composition of a 
group of employees with respect to their age, sex, educational level, skills, length of 
service, and other features (Pfeffer, 1983). The demographic structure of employees 
is affected by the flow of people into, through, and out of the organization. Human 
resource policies and practices which alter these flows ultimately alter the shape of 
the organization’s demography in the future. For example, an increase in the rate 
of hiring reduces the median tenure of employees, whereas early retirement and 
seniority-based layoff practices increase the median tenure. These actions may also 
alter the demographic features of employees across hierarchical levels and thereby 
may change future promotions within the system (Kefitz, 1973; Rosenbaum, 1979; 
Stewman & Konda, 1983). 


There is increasing evidence that the demographic structure of an organization 
(or a group of employees within the organization) can have a profound impact on 
organizational effectiveness. Katz (1982) has recently explained how the 
innovativeness and flexibility of work groups tend to decrease as the average tenure 
of members in the group increases. In this respect, Husby and Riffel (1979) re- 
ported that older teachers tend to be less innovative in their methods and some- 
what less open to the concerns of students. The authors explained that this 
situation tends to be exacerbated during times of restraint because the more ener- 
getic and successful teachers are more likely to leave the school district. In a re- 
lated study, Dembiski and Gay (1980) reported a negative correlation between the 
average age or experience of a group of teachers and the achievement of students 
taught by that group. They concluded that school districts should frequently re- 
plenish their human resource complements with younger teachers in order to gen- 
erate new energy and innovative ideas. 


The loss of more senior teaching staff may also have undesirable consequences 
for the school system. In one survey of British manpower reduction practices, 
employers in several industries regretted losing so many senior people through 
early retirement (Flynn & Niven, 1982). They discovered that the younger age 
structure of the work force created by the early retirement policy had weakened 
the stability of the organization. Several writers have explained that more senior 
employees tend to provide the social foundation from which corporate values are 
maintained and transmitted (McNeil & Thompson, 1971; Ouchi & Jaeger, 1978). 
These values act as a form of organizational control which stabilize the activities of 
the workforce. When a large proportion of the senior staff leave, this form of 
stability and control weakens. 


The level of interpersonal and intergroup conflict throughout an organization 
can also be affected by the demographic structure of its workforce. McCain, 
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O'Reilly, and Pfeffer (1983) reported that earlier hiring practices led to large age 
gaps and length of service gaps among faculty in some departments of a large uni- 
versity. These gaps tended to generate conflict due to the differences in values held 
by the various cohorts within each department. Thus, in departments with large 
length of service and age gaps, conflict was higher, and this led to a higher rate of 
turnover. In terms of manpower policy, this finding suggests that reduction-in-force 
actions such as hiring freezes and seniority-based layoffs can lead to gaps in the 
demographic structure of teachers within a school system, which can lead to 
dysfunctional conflict. 


Given the complex implications of reduction-in-force policies on the school sys- 
tem, administrators may wish to consider the expectations that teachers have re- 
garding the various options being considered. In particular, a better understanding 
of how teachers evaluate the impact of layoffs, early retirement, and other actions 
can help administrators assess teacher reactions to these policies as well as identify 
other outcomes which they had not recognized previously. This additional informa- 
tion would ultimately assist the decision-maker in choosing the best reduction-in- 
force option under the circumstances (Bass, 1983). So far, there has been very little 
published research on teacher attitudes and expectations regarding various man- 
power options, particularly where those options are being actively considered and 
would directly affect the teachers being surveyed. 


The main objective of the present study is to identify the preference for an 
early retirement program among a sample of teachers in two British Columbia 
school districts, and to explore their expectations regarding the demographic and 
organizational consequences of reducing the teaching force with and without the 
early retirement program. The analysis centers around two sets of basic research 
questions. First, which of the two reduction-in-force options (i.¢., with or without 
early retirement) is preferred, and which organizational outcomes most influence 
this preference? The answer to these questions will enable administrators under 
conditions of retrenchment to more clearly understand teachers’ preferences and 
the problems they anticipate with or without an early retirement program. The sec- 
ond set of questions which this study addressed is: Which demographic changes do 
teachers expect will result from each option over the next five years, and how accu- 
rate are these expectations? Analysis of this issue will help us to determine how 
valid the teachers’ beliefs are, at least with respect to anticipated changes in the 
demography of the teaching force. 


Method 

Research Sites 

The data for this study were gathered from a sample of teachers in two large school 
districts in the Lower Mainland of British Columbia. As part of a larger research 
project on human resource planning in the public school system, these two school 
districts were invited to participate because both were facing a significant 
oversupply of teachers over the next few years, and both were in the process of de- 
termining how to best reduce the size of their respective teaching forces. One of the 
districts employed approximately 520 elementary and 390 secondary teachers dur- 
ing the 1983-1984 school year. The other district is a slightly larger jurisdiction 
which employed approximately 550 elementary and 420 secondary teachers during 


the 1983-1984 school year. 
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In both districts, administrators were deciding whether to continue an early re- 
tirement program which had been first introduced two years earlier in the two 
school districts. In other words, they were essentially choosing between two 
reduction-in-force options. The first option called for the continuation of an early 
retirement program for the next five years. The surplus number of teachers would 
also be reduced through normal attrition with a general hiring freeze, except for ex- 
ceptional needs (e.g., French immersion). If necessary, the supply of teachers would 
be further cut by the nonrenewal of contract personnel. Only when all contract 
teachers had been laid off would continuing teachers be laid off. Although it was 
hoped the last stage of this first option would be avoided, the administrators had 
indicated that seniority would be the most likely criterion. The second option fol- 
lowed the same stages as the first except that there would be no early retirement 
program. Thus the number of teachers would first be reduced through attrition, 
followed by the nonrenewal of contract teachers, and finally the layoff of continu- 
ing teachers on the basis of seniority. For the sake of simplicity, the two alterna- 
tives will be labeled as the early retirement and no early retirement options, 
respectively. 


The participants in this study included teachers from eight elementary and two 
secondary schools in the two school districts. The schools were not sampled ran- 
domly, but were selected on the basis of their representativeness and availability to 
participate. An 11th school declined to participate, ostensibly because the principal 
felt that the survey questions would further upset the teachers. The two secondary 
schools were fairly large, each employing between 70 and 85 teachers. The elemen- 
tary schools were much smaller with approximately 15 to 30 teachers in each. 


The data were collected by a questionnaire administered to all teachers in at- 
tendance at general meetings in each of the ten schools. The researchers first ex- 
plained the purpose of the study and described the two policy options to the 
teachers. Both options were also described in the questionnaire. At the end of the 
meeting, the teachers handed in the completed questionnaires anonymously. Over 
60% of the teaching staff in each school (and virtually all of those in attendance at 
each meeting) completed the questionnaire. The final sample consisted of 217 
teachers; 124 were from District A and 93 were from District B. 


Measures 

The questionnaire consisted of three sections for each of the two options. In the 
first section, the teachers were asked to indicate what changes in the demographic 
characteristics of the teaching staff would result from each option over the next 
five years. Eleven potential demographic outcomes were listed in this section (e.g., 
“average seniority of teachers”) and the answers were marked on a 5-point response 
format ranging from “much higher” to “much lower.” The second section asked the 
teachers to assess what effect each option would have on several aspects of the edu- 
cational system over the next five years. Thirteen possible organizational outcomes 
were listed in this section (e.g., “quality of graduating students”) and these answers 
were also indicated on a 5-point response format ranging from “much higher” to 
“much lower.” The final section consisted of a single item which asked the respond- 
ents to indicate on a 5-point scale how favorably or unfavorably they viewed that 
particular option overall. The three sections of questions were then repeated for 
the second option. 
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The items on potential demographic and organizational outcomes were based 
on several sources of information. A number of items were identified from the liter- 
ature on organizational demography, organizational decline, and human resource 
planning. School administrators in the two districts as well as in a smaller third 
school district were also interviewed in order to identify their expectations. In one 
of the school districts, a special joint committee of administrators, principals, 
trustees, and representatives from the teachers’ association was consulted, and an 
early draft of the questionnaire was reviewed. In the other district, the administra- 
tion, teachers’ association, and principals in the selected schools were approached 
separately. Some items also resulted from personal interviews with several teachers 
and from a pretest of an earlier questionnaire on a sample of teachers. (These 
teachers were not included in the final sample.) 


Results 
Demographic Outcome Expectations 
Prior to examining the teachers’ overall opinions and expectations of specific or- 
ganizational outcomes of each option, data on their expectations regarding demo- 
graphic changes are presented. The objective of this is to identify the type of 
demographic changes anticipated by the sample and to compare these expectations 
with the results of a computer-based model (described below). To the extent that 
the computer-generated projections are accurate, this will allow us to examine the 
validity of the teachers’ perceptions. 


Table 1 presents the teachers’ initial expectations of the impact of the two 
manpower options on the demographic structure of teachers in the school system 
over the next five years. The respondents generally believe that the percentage of 
teachers over 50 years of age would increase and the percentage under 35 years 
would decrease, no matter which reduction-in-force policy option was chosen. How- 
ever, the teachers judged that the early retirement option would result in a smaller 
increase in the proportion of teachers over 50 and a smaller decrease in proportion 
of teachers under 35 than the option which excluded the early retirement program. 
Consistent with these observations was the expectation that the average seniority 
of teachers in each school district would increase much more if the early retirement 
option was not chosen. In other words, the continuation of the early retirement 
program was judged to significantly moderate the aging of the teaching work force 
in these two districts. 

Table 1 also indicates that teachers expected significantly more continuing (i.e., 
permanent) teachers to be laid off and significantly fewer contract (i.e., temporary) 
and continuing teachers to be hired if the early retirement program was not used to 
reduce the size of the work force. In contrast, the respondents anticipated that 
there would be a slight increase in both the percentage of teachers going on a leave 
of absence and the number of contract teachers who would not receive a further 
contract, no matter which reduction-in-force option was implemented. 

It should be noted that teachers were generally pessimistic about both options. 
Generally, they expected an increase in continuing teacher layoffs and a reduction 
in both contract and continuing teacher hirings under either scenario. Finally, the 
survey participants anticipated a slightly smaller decrease in the percentage of con- 
tract teachers and a slightly larger increase in the percentage of part-time teachers 
under the early retirement option. The increase in the percentage of people teach- 
ing outside their primary subject area was predicted to be about the same under 


both options. 
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TABLE 1 


Expected Demographic Outcomes of Human Resource Options 


With Early No Early 
Retirement Retirement 
Demographic Outcome Mean S.D. Mean S.D. t-value 
1. Percent Over 50 Years 3.49 LO 4.33 0.86 SNMOQOR 
2. Percent Under 35 Years 2.45 ald) 2.08 12OF AUTO pees 
3. Average Seniority 3.54 0.94 4.05 0.80 —n00** 
4. Percent Going on Leave 3.14 0.97 3.24 1.08 —le29 
5. Non-renewal of Contract Teachers 3.35 33 3.30 1.46 0.41 
6. Number of Continuing Teachers Shes) 0.94 3.69 0.99 — yates 
Laid Off 
7. Number of New Contract Teachers 2.48 ifs Pols 1.10 Shier 
Hired 
8. Number of New Continuing 2.14 1.00 ip 0.89 Ole 
Teachers Hired 
9. Percentage of Teachers Who Are on v7 ks) 1.18 2.50 1.29 2.46* 
Contract 4 
10. Percentage of Teachers Who Are 3.63 0.87 3.48 1.01 2:30* 
Part-Time 
11. Percentage of Teachers Teaching Seo)t 0.85 4.01 0.81 == OG 


Outside their Primary Subject Area 


* p<.05; ** p<.01; *** p<.001. N ranges from 208 to 217. 


Response Range: 5=much higher, 1 = much lower. 


Many of the demographic changes predicted by these teachers can be compared 
with human resource projections developed by the authors on each of the two 
school districts. As part of the mandate of the research project, the project team 
gathered data from personnel records on the demographic characteristics of 
teachers in each school district over each of the previous four years. From this 
information, estimates of human resource flows into, through, and out of each 
district were established and computed as age-dependent probabilities for 
elementary and secondary teachers in each district. The probabilities of a teacher 
entering or leaving the system in a variety of ways were estimated using historical 
patterns. These flow parameters were then entered into an advanced stock-flow 
computer simulation model developed by the Institute of Manpower Studies (1983) 
in Great Britain. The model is similar to the method described by Mahoney, 
Milkovich, and Weiner (1977) except that separate probabilities are computed for 
each age group for greater precision. 


The results of the computer simulation paralleled the expectations of the 
teachers in many respects.! The projections for both school districts indicated a 
general aging of the work force under both scenarios, although the increase in the 
proportion of teachers over 50 would be smaller under the early retirement option. 
The respondents also correctly indicated that there would be much lower hiring 
levels of contract and continuing staff and an increase in the number of contract 
teachers whose contract would not be renewed. While fairly accurate in many 
respects, the teachers’ expectations were somewhat more pessimistic than the 
computer-based projections regarding the layoff of continuing staff. The human 
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resource planning model projected only a few layoffs (i.e., 20-30) of continuing 
teachers in the secondary system of one of the districts between 1986 and 1989 if 
early retirement is not continued. However, the same district is projected to have 
an equivalent shortfall of elementary teachers during that period of time. Through 
selective transfers, the layoff of any continuing teacher could be avoided under 
either policy option.’ 


Overall Favorableness and Organizational Outcome Expectations 

One of the central concerns of this study was to identify the teachers’ preference 
for an early retirement program and to find out which specific outcomes seem to 
influence this preference. The respondents’ overall favorableness to each option 
was marked on a 5-point anchored scale. The early retirement option was given, on 
average, a slightly better than neutral response (K=3.19, SD=1.22) whereas the 
option without early retirement was generally perceived to be somewhat unfavor- 
able (X=1.96, SD=0.88). The early retirement option was rated significantly 
higher than the no early retirement option (t=12.25, p<.001). However, it is evi- 
dent that the participants in this study are not enthusiastic about either manpower 
policy. 

As was described earlier, 13 potential outcomes of various reduction-in-force 
actions were identified from the literature as well as from discussions with adminis- 
trators and teachers. The teachers in the present study were asked whether there 
would be more or less of each of these outcomes under each of the two policy con- 
ditions over the next five years. The results, which are shown in Table 2, could as- 
sist administrators who are contemplating either of these actions in anticipating 
the specific concerns that teachers may have. 

As the mean scores for all 13 outcomes in Table 2 indicate, the respondents 
have a generally gloomy outlook on the future of their organizations over the next 
five years, no matter which manpower option is chosen. There are differences in 
the teachers’ judgments of the two options, however, which led them to favor the 
early retirement option. For example, they anticipated that the general quality of 
education and of graduating students would decrease, but not quite as precipi- 
tously, under an early retirement program. 

A similar pattern emerged regarding the respondents’ judgments of the six 
outcomes regarding teaching staff. Teacher morale, skills, commitment to students, 
involvement in extracurricular activities, and overall effectiveness were expected to 
decline and the incidence of teacher burnout was expected to increase. However, 
the introduction of an early retirement program was believed to significantly atten- 
uate the severity of these changes. In particular, Table 2 indicates that the infusion 
of new skills, knowledge, and talent would be less adversely affected by the inclu- 
sion of an early retirement program. 

Table 2 also indicates that teachers expect the reduction-in-force actions with 
early retirement to improve only marginally the level of cooperation or reduce the 
level of conflict than the actions without early retirement. Specifically, cooperation 
between teachers and administrators and, to a lesser extent, relations between 
teachers and students are expected to be less severe with early retirement. How- 
ever, these improvements are marginal compared with the changes reported above 
regarding the teaching staff and educational system. 

In order to identify the relationship between the specific organizational 
outcomes and the overall favorableness ratings for each option, two stepwise 
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TABLE 2 


Expected Organizational Outcomes of Human Resource Options 


With Early No Early 
Retirement Retirement 
Organizational Outcome Mean SD: Mean S.D. t-value 
A. The Educational System 
1. General Quality of Education 2.52 0.91 2722 .79 7 (Si 
2. Comprehensiveness of Educational aol 0.87 2.25 0.74 1.08 
Services Offered 
3. Quality of Graduating Students 2.68 0.73 Deisy) 0.70 SHOR 
B. Teaching Staff 
4. Morale of Teaching Staff PRN 1.01 Eom 0.75 30275 
5. Infusion of New Skills, Knowledge, 2.41 ili 1.88 0.84 (eehe* 
and Talent 
6. Teacher Commitment to Student 2.70 0.79 2.49 0.69 3495" 
Development 
7. Teacher Involvement in Extra- Dog 0.96 2.02 0.78 Doma 
Curricular Activities ‘ 
8. Incidence of Teacher Burnout 3.66 1.18 3.98 1.07 = 3.04"** 
9. Overall Teacher Effectiveness 2.52 0.86 2.26 0.76 ADRs ae 
C. Cooperation and Conflict 
10. Conflict Among Teachers and 3.02 0.81 3.47 0.81 0.62 
Department 
11. Cooperation Between Teachers and 2e1o 0.69 2.59 0.68 3.09** 
Administrators 
12. Relations Between Teachers and 2.76 0.69 2.62 0.71 2.47* 
Students 
13. Conflict Among Schools Within the rep 0.69 3.34 0.67 — his 
District 


*p<.05; **p<.01; ***p<.001. N ranges from 208 to 217. 


Response range: 5= much higher, 1 = much lower. 


regression analyses were conducted. Each stepwise equation calculated the weight 
of the organizational outcome as a predictor of the overall favorableness rating for 
each reduction-in-force option. The results of this analysis showed that the general 
quality of education and the morale of teaching staff were the only two 
organizational outcomes listed in Table 2 which uniquely explained variance in 
each of the two human resource options. One-quarter of the variance in the overall 
evaluation of the early retirement option and one-third of the variance in the no 
early retirement option were explained by these items. 


Discussion and Conclusions 

The findings reported in this article indicate that the teachers surveyed were 
comparatively more favorably inclined towards the human resource policy with an 
early retirement program than the same set of actions without early retirement. In 
a more absolute sense, on the other hand, the respondents were not very 
enthusiastic about the consequences of either option. The early retirement choice 
averaged slightly above neutral on the scale of overall favorability while the no 
early retirement choice rated much lower. 
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These negative expectations were also reflected in the teachers’ opinions of the 
specific organizational outcomes resulting from either option. They felt that an 
early retirement program would attenuate the adverse impact of retrenchment of 
the performance, commitment, and satisfaction of the teaching staff, but the 
changes would still be for the worse. Similarly, the general quality of education and 
the quality of graduating students were expected to decrease over the next five 
years, although not quite as precipitously under the early retirement option. 

These observations reflected the general pessimism that many public school 
teachers in British Columbia felt at the time of this research. Many felt closed in 
on one side by the government cutbacks and on the other by only modest public 
support for, and understanding of, the teachers’ predicament. Possibly a more 
favorable policy option exists in the minds of teachers, but the two reduction-in- 
force alternatives included in this study were identified by school administrators as 
the most feasible. 


Based on the multiple regression results, the respondents’ overall evaluations of 
the two reduction-in-force options were influenced mainly by the anticipated 
changes in the general quality of education and teacher morale. This suggests that 
the teachers’ concerns are balanced between the school system and their own 
welfare, rather than totally influenced by one or the other. This issue of 
cooperation and conflict did not figure prominently in their overall opinions of 
these two policy alternatives. 

Another interesting finding that emerged is that the participants’ expectations 
regarding the demographic changes resulting from each human resource option 
were comparable to the projections generated from the computer-based human 
resource planning model. However, they were also more pessimistic about the 
layoff of continuing teachers than the computer estimates indicate. To the extent 
that the computer projections are valid, the teachers in this sample appear to have 
reasonably accurate perceptions of the demographic alterations resulting from 
specific human resource actions. Only a general sense of pessimism tends to bias 
their estimates slightly. 

There are grounds for believing that these findings are generalizable to other 
school districts facing the need for substantial reductions in their teaching 
complements. In this study, elementary and secondary teachers in two school 
districts were included in order to increase the generalizability of the results. The 
findings were substantially the same for each of these groups. The teachers all 
faced various degrees of restraint as well as an aging workforce, and therefore saw 
early retirement as a relatively desirable human resource option. Where other 
school districts similarly have an aging teacher population and face the need to 
reduce the number of teaching staff, the findings reported here may be quite 
relevant. Future research on teacher preferences and expectations is required, 
however, to better understand the implications of early retirement and other 
human resource policies under different demographic and environmental 


conditions. 
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Notes 
le Due to space limitations, the actual computer projection results are not shown here. They are available from the authors on 
request. 
2. The reader should be cautioned that the computer model does not take into account skill differences among teachers. Due 


to an oversupply of certain skills (especially in secondary schools), layoffs may be necessary if retraining is not possible. 
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An Examination of Sex Differences in 
Secondary School Students’ Attitudes 
Toward Writing and Toward Computers 


In order to examine the basis of the assumption that writing-related computer expe- 
riences might be an effective way to improve the attitudes of adolescent females 
toward computers, attitudes of secondary school males and females toward writing 
and toward computers were measured and compared. Consistent sex differences were 
found, with females more positive about writing than their male peers but less posi- 
tive than males with respect to attitudes toward computers and computer users. The 
results support the use of writing as a focus of computer literacy experiences for sec- 
ondary school females, in that the use of the computer as a writing tool may serve to 
transfer the positive feelings that females have about themselves and writing to the 
attitudes that they develop about themselves and computers. 


Sex differences in secondary school students’ attitudes toward computers and in 
their participation in computer-related studies and activities are a subject of com- 
ment and concern (Bakon, Neilson, & McKenzie, 1983; Chen, 1985; Schubert, 
DuBois, & Wolman, 1985). Most of the evidence upon which concern is based is 
either data describing sex differences in enrolment in computer science courses or 
anecdotal observations (“Many teachers are noticing that most of the students who 
come to the computer room during their free time are boys,” Sanders, 1985, p. 23). 
It is assumed from this type of evidence that females have less positive attitudes 
about computers than do males, although little empirical work has been done to 
support this assumption (Sanders, 1984). Lockheed (1985) summarizes the current 
evidence about gender association and computers by stating that “recent data on 
sex differences in computer use at home and at school support the idea that our 
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culture is defining computers as predominantly male machines” (p. 116). She, like 
others, warns that “dire consequences” can befall females who are “left behind in an 
increasingly technological society” (p. 120). 

To help prevent these dire consequences, many individuals and agencies have 
already produced intervention strategies the aim of which is to increase the amount 
of voluntary computer use for females as well as improve their attitudes toward 
computers. One of the strategies which is frequently recommended is to promote 
word processing as a focus of school-based experiences (Lockheed & Frakt, 1984). 
Word processing is perceived as likely to be more attractive to females than the 
typical programming orientation of secondary school computer studies (Lockheed, 
Nielsen, & Stone, 1983). These recommendations are based on the assumptions 
that females have a positive attitude about writing and that this positive 
orientation will transfer to their computer experiences. The systematic measure- 
ment of the attitudes of secondary school students toward computers and toward 
writing is a necessary first step in the investigation of these assumptions. There 
has, however, been little systematic development of measurement instruments for 
these purposes. 

Writing apprehension, as a form of negative attitude toward writing, has been 
studied by Daly (1979) and Daly and Wilson (1983). The latter found males to be 
more apprehensive about writing than are females. Price and Graves (1980) also 
note sex differences in writing attitude and cite “the stereotype of female 
superiority with language” (p. 151) as contributing to relatively negative attitudes 
toward writing in males. They conclude that such negative attitudes will affect 
writing achievement “thus creating a problem that teachers will have to attack at 
the roots—the attitudes themselves” (p. 152). Overall, however, in the field of writ- 
ing research, studies of writing attitudes are rare and writing-attitude scales are not 
readily available. The Third National Writing Assessment (NAEP, 1980) is one of 
the few major sources of writing-attitude items. 


Similarly, only a few scales that deal with secondary school students’ attitudes 
toward computers have appeared in the literature (Minnesota Educational Com- 
puting Consortium, 1979; Ellsworth & Bowman, 1982). The majority of discussions 
involving sex differences in-secondary school students’ attitudes toward computers 
are based primarily on informal observations. 


This study therefore had two levels: first, to develop appropriate instrumenta- 
tion to measure secondary school students’ attitudes toward writing and toward 
computers; and second, to use data obtained from these measures to study the fol- 
lowing questions: (a) Are there sex differences in secondary school students’ atti- 
tudes toward computers or in their attitudes toward writing? and (b) Are 
secondary school females more positive and self-confident about writing than they 
are about computer activities? The overall purpose of the study was to provide 
specific evidence with which to address the assumption that word processing expe- 
riences might provide a remedy for the “dire consequences” facing secondary stu- 
dents, particularly females, who are uncomfortable with technology. 


Method 
Subjects 
There were two major parts of the study. One involved one group of students in the 
development and implementation of a writing attitude assessment and the second 
involved three additional groups of students in the development and 
implementation of a computer attitude assessment. 
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TABLE 1 


Student Samples for Writing and Computer Attitude Assessments 


ee eeeeeeeSsSs—esesees 


Attitude Focus Male Female 
Writing Grade9 Grade 12 Grade9 Grade 12 
Assessment® 
56 91 ae: 85 
2 ee ee eee a i ee es Os 
Computer Grade 8 Grade 12 Grade 8 Grade 12 
Assessment 
ee ee ee es eee eee 
Phase 1 43 36 49 35 
Phase 2 246 237 197 206 
Phase 32 539 419 479 381 


N.B.a. School District A 
b. School District B 


A total of 3,014 secondary school students in Grades 8, 9, and 12 in two school 
districts in British Columbia were involved in the overall study. Table 1 
summarizes the student sample. 


The writing assessment study and phase 1 of the computer assessment study 
involved 18 intact classes of eight different junior and senior secondary schools in 
the two school districts. The second phase of the computer attitude study involved 
all the Grade 8 and Grade 12 students in one of the two school districts. The third 
phase involved a random selection of approximately 50% of the Grade 8 and Grade 
12 students from the second district. More than 70% of the students in the 
computer attitude study had had some experience with computers in school; almost 
all of this experience was in the context of computer literacy units within 
mathematics classes or elective computer science courses. 


Procedure 

Instrumentation. Two attitudinal question sets were developed in this study. To 
measure attitudes toward writing, 10 items were selected from the Third National 
Writing Assessment (NAEP, 1980) and an additional 20 statements were con- 
structed by the researchers based on previous reviews of the literature in both 
reading and writing (Ollila & Michaux, 1983). All items were accompanied by a 
Likert-type scoring scale, with “1” designating “strongly agree” and “5” designating 
“strongly disagree.” Negatively worded items were recoded prior to analysis so that 
a low score on each item represents a more positive attitude than does a high score. 
The items, and summary statistics relating to the items, are given in Table 2. 

Data from this administration of the writing attitude survey were analyzed to 
select a subset of the items most representative of underlying trends in the student 
responses. Principal components factor analysis was used for this purpose. Three 
factors with eigenvalues greater than 1.00 were extracted and were interpretable, 
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TABLE 2 
Means of Responses to Writing Attitude Survey 


Item oe Grade Grade 
9 Female 9 Male 12Female 12Male 


1. [like to write down my ideas. 3.092 3.07 2.11 2.45 


2. Lam no good at writing. (R) 2.70 2.83 2.24 2.05 
3. Expressing ideas through writing seems to be a 2.28 Dale 1.69 1.81 
waste of time. (R) 
4. People seem to enjoy what I write. 2.81 3.36 Pratl 2.74 
5. Iexpect to do poorly in composition classes 2.83 3.07 2.36 2.07 
before I take them. (R) 
*6. [look forward to writing down my ideas. 3.49 3.44 2.70 28 
7. Iwrite for other reasons besides school. 2.42 3.16 2.29 2.82 
*8. When I hand ina composition, I know I’m going 2.44 2.43 PNG) 1.89 
to do poorly. (R) 
*9. Tenjoy writing. DIST 3.42 Qaly, 2.44 
10. Iam afraid of writing essays when I know they 2.67 2.70 2.70 2.42 
will be evaluated. (R) 
*11. I feel confident in my ability to clearly express eal! 3.50 Qe 2.42 
my ideas on writing. 
12. Lavoid writing. (R) PT 3.05 Plat 29 
13. If I had my way, everybody would study writing. 3.91 3.60 3.60 3.74 
14. Writing is one of the most useful subjects I 2.81 3.34 2.52 2.65 
know. 
15. All people should be good at writing. 2.87 3.05 2.89 2.98 
16. Writing is very interesting. Sele 3.30 2.30 2.59 
17. Writing might be worthwhile if it were taught 2.93 ZH Syl 3.65 
correctly. (R) 
18. I wouldn’t bother writing if I didn’t have to. (R) 2.91 2.91 22 2.45 
19. I can’t see how writing can help me. (R) 2.36 2.30 1.58 il (A 
20. Ireally enjoy writing. 2.68 3.61 239 2.68 
21. Ilike to work hard at writing. 3.16 3.66 252) 2.96 
22. Ifind writing useful at home. 3.38 3.48 2.46 3.05 
*23. [sometimes do extra work in writing at home. 3.18 3.62 2.81 3.36 
24. I can’t see how writing will be useful to me 1.98 2.31 1.64 1.79 
outside of school. (R) 
25. Writing teaches me to be accurate. 2.93 3.01 2.29 2.41 
26. Writing is a waste of time. (R) Diol 2.84 1.47 haga 
27. Writing is the best subject in school. 3.54 3.82 3.29 3.74 
28. My parents write at home. 2.61 2.55 2.88 2.58 
29. Iwill write after I finish school. 2.96 389 2.34 2.43 
30. Being a good writer will help me in my adult life. 2.43 2.66 2.08 1.89 
Notes: 


a. The lower the score, the more positive the attitude. 
b. An asterisk (*) indicates the variable was selected for the final survey instrument. 
c. (R) indicates the scoring was recoded so that strongly disagree= 1. 


having loadings greater than .700 after varimax rotation. The factors were labeled 
Enjoyment, Self-Confidence, and Utility of Writing. The item with the strongest 
loading on each of these rotated factors was selected for inclusion in the final 
survey instrument. In each case, the item was judged to be representative of the 


interpretation given to the retained factor. These items are indicated with asterisks 
in Table 2. 
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The set of computer attitude items was developed using McKennell’s (1974) 
3-phase model for surveying attitude structures. The first phase of this model 
requires that a representative sample of the target population aid in generating the 
content pool from which the selection of items for the measuring instrument is 
made. The second phase involves administering a large sample of the items in the 
content pool to a representative group of subjects so that reduction techniques can 
identify the subset of items most appropriate for a final instrument. The third 
phase involves administering the final instrument to a new sample of subjects. 


In this study the first phase involved interviews with 156 secondary students. 
The interview questions asked the students to describe their opinions about people 
who use and like computers, about difficult aspects of learning to use computers, 
and about ways computers can be used in school and at home. Students also 
responded to sentence completion items and to a semantic differential instrument 
whose purpose was to see if the students had similar interpretations of the concepts 
“computer,” “microcomputer,” and “home computer,” but discriminated these 
concepts from that of “video recorder.” The semantic differential analysis showed 
the students to cluster the three computer terms closely together, but to 
distinguish this cluster from videotape recorder. This supports the validity of the 
interchangeable use of the words “computer,” “microcomputer,” and “home 
computer” throughout the subsequent phases of the study. 


Verbatim statements representing typical clusters of comments made by the 
students during these interviews were identified. These were developed using a 
Likert-type response format and administered to an additional 40 students. Some 
items were replaced or rewritten, based on analysis of these data, and a 
Preliminary Instrument consisting of 87 statements was prepared. By using these 
procedures, the content validity of the survey items, as representatives of 
secondary school students’ attitudes toward computers was supported. It was not 
the researchers’ opinions about what students thought, but the students’ own 
words which formed the preliminary survey instrument. 

Phase 2 of the study involved the administration of the Preliminary Instrument 
to a different sample of 887 Grade 8 and Grade 12 students. The data were 
analyzed using both factor analysis and discriminant function analysis, and the set 
of 24 computer attitude items to be used in the final survey instrument was 
selected. Full details of the procedure used in phases 1 and 2 (the instrument 
development and validation) may be obtained from the first author on request 
(Collis, 1984). 

Phase 3 involved the final survey administration. The final survey instrument 
for this study consisted of 29 attitude items: the 24 computer attitude items (still 
in students’ own words) and the five writing attitude items. Each item was 
accompanied by provision for Likert-type scoring, where “1” again represents 
“strongly agree” and “5” represents “strongly disagree.” Negatively worded items 
were recoded before data analysis, so that a low score is consistently associated 
with a positive attitude. 


Results 
Items on the final survey along with mean scores and standard deviations of 
student responses are given in Table 3. The statements relating to attitudes toward 


writing are marked with asterisks. 
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TABLE 3 


Means and Standard Deviations, Final Survey Instrument, Attitudes Toward 
Computers and Other School Subjects 


Item 


Grade 8 
Male 
(n=539) 


1. I think that a home computer can be very 
interesting 

2. People managed before without 
computers, so computers are not really 
necessary now. (R) 


3. I would like to learn how to use a computer. 


*4. When I hand in an essay, I feel I’m going to 
do poorly. (R) 

5. People who like computers are often not 
very sociable. (R) 

6. Computers are exciting. 

7. Iwould not expect a good athlete to like 
computers. (R) 

8. Computers will never interest me. (R) 

*9. [look forward to writing down my ideas. 

10. I would be embarrassed to tell my friends 
that I would like to join a computer club. 
(R) 

11. Ifyou like science you will like computers. 
(R) 

12. The world would be better off if computers 
were never invented. (R) 

13. Working with computers is not my idea of 
fun. (R) 

14. Typing would be the biggest problem I 
would have in learning to use a home 
computer. 

15. Computers do not interest me. (R) 

16. You have to be smart to like computers. 
(R) 

*17. I feel confident in my ability to clearly 
express my ideas in writing. 

18. Computers are fun. 

19. Microcomputers are easy to use. 

20. Ifmy family had a home computer, I would 
probably use it more than anyone else. 

21. Females have as much ability as males 
when learning to use a computer. 

*22. Isometimes write stories at home even if 
they are not assigned for school. 

23. Iam concerned that people might make 
computers too powerful in the future. (R) 

24. Iwould rather spend an evening doing 
something new with a computer than go 
out with my friends. 

25. Ienjoy working with computers. 


302 


1.702(0.94)> 
2.38(1.14) 
1.76(1.02) 
3.01(1.16) 
2.49(1.21) 


1.95(1.05) 
2.37(1.19) 


1.73(1.06) 
2.90(1.09) 
2.15(1.21) 
2.62(1.06) 
2.00(1.15) 
2.18(1.17) 
3.12(1.34) 
1.98(1.19) 
1.95(1.00) 
2.67(1.03) 
1.74(0.98) 
2.81(0.99) 
2.08(1.19) 
2.14(1.24) 
3.74(1.27) 


3.19(1.25) 


3.31(1.31) 


1.94(1.00) 


Grade 8 
Female 
(n=479) 
1.87(0.84) 
2.79(1.09) 
1.90(0.96) 
2.91(1.18) 


2.40(1.19) 


2.33(1.08) 
2.33(1.09) 


1.92(1.00) 
2.73(1.06) 
2.13(1.09) 
2.38(0.98) 
2.19(1.04) 
2.36(1.14) 
3.63(1.20) 
2.22(1.16) 
1.84(0.97) 
2.56(1.00) 
2.04(1.02) 
2.93(0.71) 
2.48(1.28) 
1.44(0.91) 
3.05(1.34) 


3.670(1.11) 


3.86(1.14) 


2.26(1.02) 


Grade 12 
Male 
(n=419) 
1.85(0.80) 
2.40(1.25) 
1.96(0.99) 
2.79(1.18) 


2.70(1.19) 


2.51(1.07) 
2.31(1.10) 


1.92(1.02) 
2.83(1.05) 
2.32(1.17) 
2.59(1.09) 
2.08(1.18) 
2.61(1.20) 
3.18(1.36) 
2.24(1.19) 
2.06(1.04) 
2.43(1.02) 
2.21(0.98) 
2.80(0.90) 
2.38(1.28) 
1.95(1.15) 
3.84(1.28) 


3.35(1.36) 


4.14(1.05) 


2.44(1.07) 


Grade 12 
Female 
(n=881) 
1.87(0.77) 
2.64(1.12) 
1.91(0.89) 
2.54(1.08) 


2.35(1.08) 


2.63(1.01) 
2.07(0.97) 


1.96(0.96) 
2.50(1.04) 
2.05(0.99) 
2.51(1.04) 
2.19(1.08) 
2.71(1.16) 
4,05(1.18) 
2.37(1.22) 
1.97(0.94) 
2.29(1.00) 
2.28(0.87) 
2.99(0.57) 
2.83(1.22) 
1.26(0.66) 
3.34(1.38) 


3.88(1.07) 


4.29(0.88) 


2.65(0.94) 


Attitudes Toward Writing and Toward Computers 


Item Grade 8 Grade 8 Grade 12 Grade 12 
Male Female Male Female 
(n=539) (n=479) (n=419) (n=381) 
26. Using a computer in math class would 1.73(1.04) 1.79(1.04) 2.15(1.07) 2.20(0.99) 
make math more fun. 

27. It would be hard for me to learn how to 2.53(1.24) 2.61(1.14) 2.51(1.14) 2.73(1.10) 
program a computer. (R) 

28. Computers are boring. (R) 1.83(1.15) 2.03(1.15) 2.23(1.15) 2.24(1.03) 


*29. I donot enjoy writing stories or essays. (R) 3.18(1.36) 2.62(1.29) 3.08(1.37) 2.57(1.31) 
eee 


Notes: a. The lower the mean score, the more positive the attitude. 
1=most positive, 5=least positive. 
b. Standard Deviations are given in parentheses. 
c. An asterisk indicates a writing-attitude item. 
d. (R) indicates the item was recoded, so that strongly disagree=1. 


An inspection of the means in Table 3 indicates that within each grade, males 
tend to be more positive than females with respect to computer-related attitudes 
while females tend to be more positive than males with respect to writing-related 
attitudes. When so many items are involved, it is not appropriate to analyze the 
data using individual t-tests because of the inflation of experiment-wise error rate 
involved in repeated univariate tests (Hummel & Sligo, 1971). Instead, the items 
were considered in two sets for statistical analysis. Discriminant function analysis 
was used, first for the set of 24 computer attitude items, and second for the set of 
five writing attitude items. In each analysis, the variables were used as predictors 
of membership in the four’ sex/grade groups—Grade 8 males, Grade 8 females, 
Grade 12 males, and Grade 12 females—and an a priori hypothesis of significant 
discrimination in the overall multivariate space was specified. Table 4 summarizes 
the two discriminant function analyses. 

Inspection of the group centroids indicates that sex differences produce the 
strongest discrimination in the attitudes of secondary school students toward both 
computers and writing. This result, combined with the mean scores, indicates that 
although secondary school females are less positive and confident than their male 
peers with regard to computer use (see items 1, 2, 6, 8, 12, 13, 14, 15, 18, 19, 20, 28, 
25, 26, and 27), these same females are more positive than their male peers with 
respect to writing. The writing attitude results replicate those obtained during the 
earlier administration of the 30-item writing attitude instrument in which Grade 9 
females indicated more positive responses than did males for 28 of the 30 attitude 
items; and Grade 12 females, for 23 of the 30 items (see Table 2). 

In addition, the discriminant analyses show age differences can be used to 
interpret the locations of the group centroids relative to the second axis in each 
multivariate space. An examination of the mean scores relative to age shows that 
the Grade 8 students were more positive about computers than were the Grade 12 
students, but less positive about writing than the Grade 12 students. 
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TABLE 4 


Results of Discriminate Function Analysis, Four Groups, 
Attitudes Toward Writing and Toward Computers 


Computers Writing 


Dimensions Wilks’ P Wilks’ P 
lambda lambda 

1 through 3 0.615 0.000 0.894 0.000 

2 through 3 0.853 0.000 0.961 0.000 

3 0.972 0.000 0.998 0.363 


Discriminate Functions Evaluated at Group Centroids 


1 2 3 


Computers Writing Computers Writing Computers Writing 


Grade 8 
Males —=O:774 0.176 =(Q135 0.196 0.143 —0.041 
Females 0.238 —0.401 — 0.434 0.104 —0.194 0.026 
Grade 12 
Males =O) 144 0.310 0.596 =sOO0 1 —0.145 0.058 
Females 0.954 — (0.095 0.081 (0) 38k0) 0.201 — 0.038 


Discussion 

This study supplies data to support the observations of Lockheed and others 
(Chen, 1985; Lockheed, 1985; Lockheed & Frakt, 1984) that secondary school 
females are less positive about computers than their male classmates. The results 
also support previous findings (Daly & Wilson, 1983; Price & Graves, 1980) that 
females are more positive than their male peers about writing. The study extends 
these two areas of previous research by comparing attitudes toward computer- 
related topics and toward writing within the same sample of secondary school 
students. The overall tendency of secondary school females, either in Grade 8 or in 
Grade 12, to be more negative than their male classmates about computer 
experiences suggests that efforts should be made to find approaches to school 
computer activities that are more attractive to females than those which are 
currently provided. Because the same girls who are more negative than their male 
classmates about computers are more positive than their male classmates about 
writing, Lockheed and Frakt’s (1984) suggestion that word processing might be a 
way to increase computer motivation for female students seems to be supported. 


This study does not specifically support the assumption that a transfer of 
positive attitudes from one experience to another will occur, or in particular, that 
the general positive affect that surrounds the writing situation for females can 
promote a reorientation of their perceptions of themselves as computer users. 
These questions still need investigation. However, the suggestion that writing be 
used as a major computer experience activity for secondary school females does 
seem better supported because of this investigation. 
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Given the findings of this study, how can educators proceed if they wish to 
associate computer experiences with writing in the secondary school? Word 
processing, or text editing in the writing class, is the most obvious suggestion, and 
many language theorists have written extensively on the positive impact that 
computer-aided writing can have on the development of the writing process 
(Daiute, 1985; Wood, 1985). But, in addition, a new perspective on computer 
literacy could be developed. This approach would emphasize the use of the 
computer for communication and information retrieval. Programming could be 
analyzed as a form of communication and its similarities and dissimilarities to 
natural language discussed. Information organization and retrieval using student- 
created or public data bases could be looked upon as a language experience. This 
awareness of the commonalities underlying computer and written literacy could 
bring new insights into the overall nature of communication and literacy (Evans & 
Collis, in press). 

Another benefit of a computer literacy approach which emphasizes writing and 
communication is also suggested by the sex difference data in this study. Males 
were relatively negative about writing compared with females but positive about 
computer use. Perhaps using a computer for writing will have a positive effect on 
attitudes for males as well—if females’ attitudes toward writing can transfer to 
their computer-related attitudes, males’ positive orientation toward computers 
might similarly transfer in a way which will improve their attitudes toward writing. 


Research does show that “student attitudes play a major role in determining 
performance [and that] students are found to perform close to their own estimates 
of their ability” (Duke, 1985, p. 673). Embedding one experience in which students 
feel relatively low in interest and self-confidence (computer work for females, 
writing for males) within another in which they feel more interested and more 
competent (writing for females, computer work for males) seems to be a valuable 
strategy. 
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The Impact of Teachers’ 
Work Life on Personal Life: 
A Qualitative Analysis 


The studies reported in this article examined teachers’ perspectives on the impact of 
work life factors on their personal lives. Unstructured and structured interviews, 
open-ended questionnaires, and observations were used to collect data from teachers. 
Study data were analyzed according to qualitative methods for grounded theory re- 
search. These analyses permitted the development of categories which describe the 
negative and positive effects of teachers’ work life on personal life. Data from the 
present study are discussed primarily in terms of Coser’s (1974) greedy institution 
notion. 


Although some progress has been made in understanding the meanings teachers de- 
rive from their work, relatively little research has focused on the relationship be- 
tween the teachers’ work life and personal life. This is especially striking because 
studies have repeatedly suggested that the two life domains may interact in signifi- 
cant ways (Waller, 1932; Dreeben, 1970; Lortie, 1975). 

Thus far, only a handful of studies of teachers have explored dimensions of 
work life and personal life interaction. Burden (1982) investigated professional and 
personal life interactions using interview data gathered from 15 teachers. The im- 
pact of professional life stresses on personal life were discussed in terms of role 
conflict. Blumberg, Kleinke, Pajak, Blase, Mehlenbacher, and Mehlenbacher (1982) 
studied several hundred teachers on Friday afternoons in a barroom setting. Their 
data suggested that certain structural and functional features of schools influenced 
the meanings teachers attributed to the barroom setting. Among other things, the 
barroom provided opportunities for cathartic release, camaraderie, warm informal 


Dr. Blase is an associate professor of educational administration whose research focuses on teachers as workers. 
Dr. Pajak is an associate professor of curriculum and supervision with research interests related to the roles of teachers and cen- 
tral office instructional supervisors. 
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relations, and territorial ownership, factors which contrasted sharply with how 
teachers experienced the school setting. Lowther, Gill, Coppard, and Tank (1983) 
examined the relationship between teacher work experiences and wider life experi- 
ences. Teachers’ work satisfaction was found to be an important determinant of life 
satisfaction. Zimmerman, Skinner, and Birner (1980) focused on the relationship 
between job strain, job satisfaction; and marital satisfaction for home economics 
teachers and their spouses. A significant positive relationship between job satisfac- 
tion and marital satisfaction was identified. And finally, Hange (1982), in a study 
of adult developmental issues of teachers in their fifth year of teaching found, for 
example, that occupational and personal life demands (e.g., taking care of children) 
were in conflict. 

Clearly, research addressing the work life-personal life interface is quite limited. 
The available research has relied on small samples, and, in most cases, has been de- 
fined narrowly in terms of specific work life-personal life problems (e.g., stress, sat- 
isfaction). It is also apparent that much of the existent work in this area has 
employed relatively simplistic concepts (e.g., role conflict) and hypotheses (e.g., 
spillover). The use of such concepts has failed to produce detailed qualitative de- 
scriptions of the complex interactions which occur between the teacher’s profes- 
sional and personal life domains. 

Given the theoretical and practical importance of understanding teacher work 
life and personal life interactions, and the paucity of data regarding personal 
meanings attributed to such interactions (Kando & Summers, 1971), the studies 
which serve as the data base for the present article were designed to explore these 
interactions comprehensively. Because of space limitations, this article focuses only 
on how teacher work life factors were perceived to affect teachers’ personal lives. 
The relationship between personal life factors and their impact on work life are de- 
scribed elsewhere (Blase & Pajak, 1985). Generally, the data suggest that work 
demands—that is, similarity of professional life and personal life responsibilities, 
restrictive community norms, low occupational status, and poor salary—have a sig- 
nificant limiting effect on teachers’ personal lives. In view of these findings, the 
study data are discussed primarily in terms of Coser’s (1974) “greedy institution” 
notion and Gouldner’s (1966) concept of reciprocation. 


Research Problem and Procedures 
As noted, data for this article were drawn from two qualitative studies of teachers. 
The first, a comprehensive case study conducted over a two-year period (1983- 
1985) was designed to explore interactions between teachers’ work lives and per- 
sonal lives. The study sample consisted of 80 full-time teachers in a comprehensive 
biracial urban high school in the southeastern United States. This high school in- 
cluded 1,535 students, three counselors, one principal, and two assistant principals. 
The school represented diversity within both its faculty and student population. 
Such diversity is important to grounded theory research methods which emphasize 
theory development (Glaser, 1978; Glaser & Strauss, 1967). 


Within the research setting, attempts were made to maximize variation among 
teachers. For example, teachers were asked to review a list of faculty initially se- 
lected for participation in the research project to determine if the range of perspec- 
tives related to the problem under investigation, had, in fact, been assessed. 
Additional teachers were included in the final interview sample. Although observa- 
tions and open-ended interviews were used to collect data from all teachers at the 
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school setting, in-depth (tape-recorded) interviews with 40 teachers were used to 
collect the bulk of the data. 


In generating data for the present article, teachers were specifically asked to: 
(a) identify and describe work factors (e.g., events, responsibilities, policies, prob- 
lems, individuals) which affected their personal lives, and (b) discuss the impact of 
these factors on their personal lives. Approximately 50% of the research data 
(total= 2,500 transcribed pages) were pertinent to the present analysis. 


A second study was designed to collect additional data on the research problem 
from a wider range of teachers. In grounded theory research, this procedure is 
utilized to increase the probability of generating new categories and elements of 
categories identified from previous research (Glaser, 1978; Glaser & Strauss, 1967; 
LeCompte & Goetz, 1982). This procedure allowed the researchers to confirm/ 
disconfirm categories and themes associated with the original case study. 


Specifically, the Teacher Personal-Professional Life Inventory (TPPLI), an 
open-ended instrument, was developed by the authors to collect data regarding the 
teachers’ perspectives on the effect of work factors on personal life. According to 
Allport (1942), a questionnaire can be a useful personal document in qualitative re- 
search addressing subjective perceptions of persons. “Any self-revealing record that 
intentionally or unintentionally yields information regarding the structure, dynam- 
ics and functioning of the author’s life” (p. xii) can be considered a personal docu- 
ment. A questionnaire is defined as a personal document when the research 
participant controls the wording of the answers. 


The TPPLI was constructed to collect interpretive data from teachers in a 
manner which encouraged free expression of personal meanings associated with the 
relationship between their work and personal lives. Two versions of the instrument 
were administered to different samples of public school teachers. To permit 
variation of responses (Lofland, 1971), the first version only asked teachers to dis- 
cuss the impact of work factors on their personal lives. Because this instrument re- 
quired about one hour to complete, it was administered to teachers (n=55) taking 
coursework at major universities in lowa and Georgia. Examination of these data 
indicated that a somewhat more structured version of the instrument could organ- 
ize the research participants’ responses without undue control. In essence, the sec- 
ond version of the TPPLI was modified to reflect only general categories analyzed 
from data collected via the first administration of the instrument. 

Glaser and Strauss (1967), Glaser (1978), and Bogdan and Taylor (1975) em- 
phasize that when the generation of substantive categories, hypotheses, and theory 
(vs. verification) are the research goals, data should be solicited from a range of 
groups to develop categories and their properties. The TPPLI was administered to 
48 male and 97 female teachers: elementary, middle, junior, and high school. Again, 
given the amount of time required to complete the instrument (approximately 1 
hour 15 minutes), the study sample was drawn from teachers enrolled in a range of 
university courses in education at three major universities in northeastern, 
southeastern, and midwestern regions of the United States during 1984 and 1985. 

On the TPPLI, teachers were directed to identify and describe two work factors 
which importantly affected their personal lives. Teachers were also asked to de- 
scribe feelings associated with each factor. Additionally, teachers described the im- 
pact of work factors on family, friends, and community. This procedure generated a 
total of 271 pages of detailed descriptive data. These data were analyzed by two re- 
searchers independently. The researchers met to discuss questions and issues 
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related to the analysis of categories. An independent assessment of the researchers 
analyses yielded 93% agreement. 


It should be emphasized that the data presented in this article were produced 
by qualitative procedures rooted in assumptions, values, and goals which differ 
from those associated with quantitative enquiry. In qualitative analysis, the em- 
phasis is on the discovery of new knowledge and the development of data-based 
categories and ideas. The goal of such research is to develop categories (and their 
properties) from relatively unstructured data. Clearly, the ideas discussed in this 
article cannot be generalized to all teachers in a statistical sense. However, the ex- 
ploratory studies reported are grounded in the notion that understanding the per- 
spectives of the teachers studied could contribute to basic knowledge about 
teachers and the meanings they associate with the impact of work factors on the 
personal life domain. Further, it is contended that the categories and theoretical 
ideas described in this article are most likely basic elements of the problem investi- 
gated. These ideas are considered the means of generalization in qualitative re- 
search which focuses on the generation of categories and theory (Hanson, 1981). 


Overview of Study Data 

The research data pointed out that such factors as excessive work demands, com- 
munity, salary, university coursework, job status, and colleagues were defined by 
teachers in terms of their negative effect on personal life. Dimensions of these and 
other factors, such as students, opportunities to enhance personal interests, com- 
munity, colleagues, university coursework, vacations, and salary were perceived to 
have a positive impact on the teachers’ personal lives. Analyses of the study data 
indicated that teachers perceived negative work life factors as exploiting their per- 
sonal life domain. The categories below describe the meanings teachers themselves 
attributed to work life factors which affect their personal lives. The description 
begins with the most frequently reported categories for negative and positive ef- 
fects respectively. 


Results: The Negative Effects of Work-Life Factors on Personal Life 
Table 1 summarizes the negative effects of work life factors on teachers’ personal 
lives. 


Excessive Work Demands 
The data indicate that teachers defined work demands as excessive. Teachers 
discussed “always being busy,” and having “never a moment to regroup” while on 
the job. Many elementary teachers pointed out that they had no planning periods. 
Secondary teachers suggested that planning periods were often “used up” for meet- 
ings and conferences. In addition, school-related work, for example, meetings, par- 
ents’ conferences, clubs, coaching, and public performances added substantially to 
the work day. “Take-home work” (e.g., lesson preparation, receiving calls from par- 
ents, calling parents, grading, typing, handouts) extended work time further into 
the evening. Demands of this nature adversely affected their personal lives. Accord- 
ing to the data, teachers spent about 15 hours per week in school-related work at 
home. Teachers experienced work as physically and emotionally “draining.” This 
meant “neglected” spouses and children. 

Even though I enjoy teaching, my own children suffer. After I have taught other people’s 

children all day long, I do not give my own children the patience or time they deserve. It 
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is hard to run a home and a classroom. On Mondays I feel I have the world under control 
. .. Wednesdays I begin to get behind . . . and by Thursdays, I really don’t care. 
Everything has to wait to the weekends. 

The data point out that teachers also ignored personal health (e.g., prepared 
quick meals, failed to exercise) and personal interests (e.g., hobbies, entertain- 
ment). A few teachers divulged that their “sex lives had been ruined” because of ex- 
cessive job demands. Teachers reported that unrealistic job demands created 
adverse moods and significantly contributed to family conflict. The data also show 
that teachers required “supportive families,” that is, families that are “understand- 
ing,” “tolerant,” “helpful” with the ability to “take care of themselves” in order to 
deal with work stresses related to inordinate demands. 


Take-home work reduced or altogether eliminated time with friends. Many 
teachers became so time-conscious that, as one stated, “I resented my friends for 
even calling on the phone.” In spending time with friends, several indicated they 
“felt guilty for not working.” In the teachers’ view, friends perceived them as 
“crazy” and “workaholics.” Six single teachers indicated that take-home work, “had 
a devastating effect” on their personal lives. Other professional friends were viewed 
as having more active social lives. A few teachers reported “[having] friends who 
understand and respect me for what I do.” The data show that teachers restricted 
or eliminated involvement in community, (e.g., church activities, community serv- 
ices, and professional organizations) because of work demands. 


Several teachers believed that lack of opportunities to “rest and relax” led to 
chronic fatigue (“feeling constantly drained”) and adversely affected work perform- 
ance. Such feelings were associated with experiencing work as “day-to-day 
survival.” 


Teachers indicated that they felt anger, frustration, resentment, stress, panicky, 
cheated, used, drained, guilty, failure, burned out, rushed, exhausted, over-ex- 
tended, bitter, anxious, disheartened, pressure, fatalistic, furious, trapped, impo- 
tent, and on the verge of collapse, as a result of the workload. 


Excessive Work Demands: Emotional Overload 
While emotional overload was a problem for most teachers represented in the gen- 
eral category of excessive work demands, some teachers, for whom this was an 
issue, focused specifically on this aspect of the problem and its implications for 
their personal lives. That is, “worries,” “stress,” “tensions,” and “problems” associ- 
ated primarily with student discipline, and helping students reach goals proved to 
be “emotionally and spiritually” taxing: 
I teach in a high school. I teach lower levels and mostly all black. The school is about 85% 
black. It’s a different culture and set of values. Many are drop outs who have returned. 
They are in trouble with the law, etc. The days are tense and riddled with incidents—one 
confrontation after another over large things and trivial ones. The tension mounts until 
3:30. When I come home, I don’t want to hear another word from anyone or give another 
order. I don’t want to hear the TV or cook supper. I just want to go to sleep and forget it 
all or maybe watch my aquarium and blot out that day’s events. 

Quite frequently, job tensions precipitated negative mood states which caused 
teachers to “neglect” spouses and children. Conflict with family members and “feel- 
ings of alienation” were related to negative job moods as well. The data point out 
that teachers became “ill tempered,” “less patient,” and “less tolerant” with family 
members: 
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I’m not as caring and nurturing with those I am closest to as I would like to be. I often feel 
I am better with strangers. 

Examination of the findings indicated that teachers’ moods had no direct im- 
pact on friendships (note: teachers reported rarely socializing with friends). 
Negative moods were however related to decreased community participation. 
Feelings associated with negative mood states were similar to those previously de- 
scribed for this category. 


Impact of Community 
Teachers expressed concern about living and working in small communities in 
which their personal lives were “under the watchful eye of Big Brother.” Teachers 
resented “being on stage,” “in a fishbowl,” and subject to the “constant scrutiny” of 
parents and students. Teachers explained that strict prohibitive norms which 
governed activities such as drinking, dancing, and casual dress deprived them of 
both privacy, anonymity, and opportunities for self-expression: 
I have lived and taught in a small rural community. The members of the community have 
definite ideas on how teachers should behave even on their own time and those ideas are 
very conservative. If we went out to dinner at a local restaurant, I would think twice 
about having a drink since the parent, brother or sister of my students would most likely 
serve it and certainly others would be watching. If we went out on the weekend, or went 
shopping, or went to the post office, we ran into my students and their families. I always 
had to be on my best behavior and never could “let my hair down.” I felt as if I was on dis- 
play or under examination most of the time. I felt that I was a teacher but not a real live 
person lots of times. 

More broadly, teachers resented community expectations for “exemplary role 
models,” particularly in the context of what was perceived as community hypocrisy. 
Others objected to “being put into the position of experts on every matter” which 
usually required extra work and responsibility. 


Several teachers explained that, because of conservative community norms, en- 
tertainment was sought out of town (half of the teachers reported no effect on fam- 
ily). Close community scrutiny also restricted the kinds of activities that teachers 
engaged in with friends. Many teachers, the data show, chose friends who could 
“commiserate” with their feelings. It should be noted that concerns about commu- 
nity were reported for suburban, urban, and rural teachers. 

With regard to the problem of public scrutiny, teachers expressed feelings of re- 
sentment, anger, and alienation. Others reported guilt and confusion because of the 
conflict between “what I think I should do and what I want to do.” Feelings of 
being “cheated,” “limited,” and restricted were described. 


Salary 
Teachers indicated that salaries were significantly inadequate. Although most 


teachers reported having working spouses, “living from paycheck to paycheck” was 
a common concern. Personal and family interests, entertainment, vacations, and 
recreation were severely restricted. One teacher remarked, “I only entertain twice 
each year... This is a burden.” Several indicated that the salary level meant living 
in “unattractive neighborhoods with unprofessional people.” In addition, financial 
problems were viewed as straining family life. Three males reported that they had 
been accused of “not being good providers” by their wives. Expenditures for the 
children were monitored closely. For single teachers along these lines, concerns 
were no less prevalent: “It means sharing some type of hovel with two or three 


roommates.” 
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Teachers explained that salary levels demanded that they moonlight. This ex- 
panded the work day, reduced preparation time, and deprived teachers of time 
with their families. Several teachers expressed concern about future retirement. 
Many “regretted” entering the profession and indicated that they would leave when 
the opportunity arose. Teachers for whom salary was an issue explained that finan- 
cial strains limited involvement with friends. Simply put, they could not afford 
many of the “functions and activities” in which friends were involved. (Several indi- 
cated no impact here.) Low salary negatively affected teachers’ perceptions of com- 
munity. Salaries were defined as reflecting lack of community respect and 
appreciation. For some, this adversely affected self-concept. Several teachers 
pointed out that the low salary structure “alienated” them from community in- 
volvement. 


Feelings were described as follows: anger, frustration, alienation, hopelessness, 
low self-esteem, depression, guilt, envy, futility; being used, unappreciated, ex- 
ploited, resentful, and regretful. 


Coursework 
According to the data, teachers perceived “great pressure” from school systems to 
continue coursework toward advanced degrees. However, “time, the cost of attend- 
ing, travel expenses, books, preparation, tests, research time” detracted from the 
quality of their personal lives. Summers were spent at the university. During the 
school year, little time was available for family: 
Some weekends when my wife and I are together, I end up spending most of my time at 
the library working. I feel guilty because our time together is so limited. 
Teachers indicated that coursework interfered substantially with time for socializ- 
ing with friends and involvement in community. With reference to the latter: 
I have reduced membership in community organizations I enjoy being in. I don’t have as 
much interaction with my neighbors as I once did. I especially regret the loss of active in- 
volvement in my local political party activities. 
In addition to anger and frustration, teachers expressed feelings of fatigue, 
guilt, pressure, being swamped, and envy (“of those with jobs which allow free 
time”). 


Job Status: In View of Society 
This category was defined more generally than community. Here, teachers 
suggested serious concern about the low status of their profession—the relative 
rank of the profession as compared to others—in society’s view. The significance of 
being a teacher in a society which “lacked appreciation” and “understanding” were 
strong in the data. Above all, teachers emphasized the public’s lack of respect and 
preoccupation with criticism: 
With all the national publicity about our nation’s poor educational system, sometimes I 
have to think twice about why I went into such a low-paying, low self-esteem job. It’s 
really depressing when you meet someone and you tell them that you’re a teacher and 
they look at you like “Oh, you’re just a teacher. You’re a nobody.” It’s also depressing to 
have to take the blame for things that are not your fault—such as the educational system 
is so bad because we have such dumb, unqualified teachers! 
Interestingly, the data show both defensiveness and acquiescence on the 
teachers’ part in response to public criticism and low occupational status. Teachers 
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indicated no important effect of the status issue on family. For some, status pro- 
duced conflict with friends. A few felt they were “not taken seriously” by the com- 
munity because they were teachers. 


Feelings such as guilt, resentment, anger, defensiveness, injustice, and inade- 
quacy were reported in connection with this category. 


Colleagues 
For some teachers, disagreements, conflicts, gossip, backbiting, rude behavior, and 
criticism with colleagues influenced their personal lives: 
A particular teacher is highly critical . . . she whispers side remarks . . . while she is 
standing within hearing distance of a group of faculty members. All of her devious ways 
and vicious talk is the very center of my personal life as I have been unable to cope with 
the teacher hostility and not bring all the built up hostility home to my personal life. 
Teachers indicated that negative feelings carried over into family life. Notwith- 
standing, families were usually supportive. Teachers perceived friends to be 
supportive as well. Impact on the teacher-community relationship was not evident 
in the data. 


Feelings of anger, hostility, and guilt were expressed with regard to the behav- 
ior of colleagues. 


Absence of Social Contacts 

Two single teachers reported that being a teacher meant a limited social life. 

Teaching failed to make opportunities available to meet others: 
I have remained single thus far and now I am finding that I want a family of my own. I 
think that the level of commitment that I’ve made in the past to work and school pre- 
cluded opportunities to focus enough energy on a social life. As I near completion of my 
degree, I find myself looking with envy upon mothers with very young children. 

Friends of these two teachers were colleagues; they were empathetic and 

supportive. The data indicated no affect on the teacher community relationship. 


Feelings of regret and sadness were suggested in the data. 


Results: The Positive Effect of Work Life Factors on Personal Life 
Table 2 summarizes the positive effects of work life factors on teachers’ personal 
lives. 


Students: The Development of Interpersonal Competencies 
The research data suggested that as a result of long term interactions with stu- 
dents, teachers develop knowledge of students’ academic, social, and personal 
needs and problems. This category was specifically defined in terms of “becoming 
more human,” themes related to the development of teacher “empathy,” “toler- 
ance,” “patience,” and “human sensitivity.” In essence, interpersonal values (e.g., 
importance of student needs), skills, (e.g., listening) and predispositions (e.g., 
willingness to be supportive) were produced: 
My 6th grade student died of brain cancer. I was profoundly affected by Stacy’s dying. It 
was hard to understand how an 11-year-old could waste away, why an 11-year-old should 
suffer and die. ... Stacy’s death made me stop and see my students more individually. ... 
Because of Stacy, I am not afraid to tell my students I love them, or that I’m afraid some- 
times and I get frustrated and unhappy. . . . From Stacy I learned that I can make a 
difference in others’ lives by sharing mine. 
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Most of the teachers whose comments were relevant to this category suggest that 
knowledge of students helped them to understand and provide support to their 
own children. Teachers became more empathetic, supportive, communicative, and 
less judgmental with their own children: 


My knowledge of child development and differing learning modes has helped me to be a 
better and more understanding parent. Since I have two children, knowing that children 
learn in different ways has helped me not to expect or demand the same behavior per- 
formance and achievement from both . . . Most of the time my children benefit from my 
patience and understanding of various phases the children are passing through. 
A few teachers stated that relations with friends benefited from interpersonal com- 
petencies learned at school. No important affect on teacher-community interaction 
was noted in the data. Major feelings expressed with regard to the development of 
interpersonal competencies were satisfaction and contentment. 


Students: Teacher Self-Esteem 
Teachers suggested that successfully influencing the academic and social lives of 
students impacted positively on their self-esteem and confidence: 
Although at times I feel very pressured by all my commitments to family and students, I 
feel very good about the fact that I am able to make a difference in someone else’s life. I 
have been out of teaching for a number of years (rearing a family) and I know the empty 
feeling from not being involved in the lives of others (other than family members). I like 
the feeling of being a person of my own... not just someone’s spouse or mother. 
Increased work satisfaction and self-esteem derived from helping students 
translated into “more harmonious” relationships with family and friends. Several 
teachers commented that the attitudes and skills they acquired from helping stu- 
dents enhanced their involvement with the community (e.g., work with social serv- 
ice agencies). 
Feelings of respect, love, warmth, satisfaction, happiness, pride, and pleasure 
were related to helping influence students. 


The Job: Enhancement of Personal Interests 
The study data show that teachers develop interests, skills, and attitudes from the 
job which transferred directly to their personal lives. Subject matter interests (e.g., 
music, photography, health, fitness), competencies related to organizing and time 
management, and positive attitudes, for example, increased self-esteem, were men- 
tioned most frequently. Teachers believed that the factors identified above en- 
hanced family life. The comments of one teacher are illustrative: 

I’m better organized. I accomplish more so I can be a better model to my children. I am 

teaching them to be task oriented. 
Some of these teachers suggested that job-related personal interests deprived the 
family of time and caused resentment. Friends were viewed as benefiting from the 
teachers’ growth because “they share in what I do.” In a few instances, the commu- 
nity was defined as profiting in the same way, although most respondents indicated 
no affect in this regard. 

Teachers linked feelings of satisfaction, pleasure, reassurance, pride, joy, 
success, excitement, belonging, and frustration with the enhancement of personal 


interests. 
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Community: A Source of Professional Pride 
The available data suggest that teachers derive pride from community recognition. 
For instance, the role of expert, that some teachers found onerous, was defined by 
others as reflecting community trust and respect: 
Teaching adds satisfaction and purpose and esteem to my personal life . . . It gives me 
status within my community because of the kind of work I do. 
The teacher’s family was viewed as “benefiting” from positive regard of the commu- 
nity. Special invitations and favors given to family members were indicative of 
community respect. Teachers claimed that in cases where communities “recognize 
the profession,” family members experienced pride. Teachers also reported that 
their relations with friends and community were affected positively. In each case, 
teachers felt “proud” when they were “approached as experts” and “authority fig- 
ures” by others. 


Feelings of “pride” were most commonly connected with community respect. 
Teachers also felt trusted and important. 


Collegial Friendships 
Teachers reported that relations with their colleagues were frequently translated 
into personal friendships based on “shared interests” and common problems. (In 
fact, most of the study sample indicated that friends were usually teachers.) As one 
teacher remarked: 
This gives me a feeling of satisfaction. It makes me feel that my job is a rewarding job be- 
cause it offers friendships. 
The data point to a positive impact on family, friends, and community, although 
the nature of this impact was unclear. 


Feelings of satisfaction, contentment, pleasure, and gratefulness were related to 
collegial friendships. 


Coursework: A Source of Knowledge and Pride 

A few teachers perceived university coursework as a source of professional knowl- 
edge and self-esteem. Although coursework reduced the time with the family, 
friends, and community, teachers suggested that they had “more to offer” as people 
because of increased knowledge. For example, being a student provided “a good 
model” for their children. 


Feelings of satisfaction, pride, confidence, commitment, and accomplishment 
were associated with coursework. 


Vacations: Time with Family 

Teachers who stressed the importance of vacations explained that little time for 
family was available during the school year and that family-related responsibilities 
were frequently ignored. Holidays and summer vacations, however, meant that 
“concentrated time” was given to family and friends. This alleviated feelings of 
guilt experienced throughout the year. Additionally, vacations meant extra income 
to some teachers through summer employment. The research evidence failed to 
clarify the impact of vacations on the teacher’s community involvement. 


Feelings of satisfaction were apparent with regard to vacations. 
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Salary: A Supplemental Income 
Three female teachers defined their salaries as “supplemental.” Consequently, 
“extra” money was available to meet family needs and pursue family interests. 
This, according to the findings, increased teachers’ personal life satisfaction and 
enhanced the overall quality of family life. The supplemental income was linked to 
housing and neighborhood fun, and increased interaction with friends. No affect on 
teacher involvement with community surfaced in the data. 

Teachers experienced increased satisfaction as a result of receiving “extra” in- 
comes. 


Summary and Discussion 

This article is based on data drawn from two studies of American teachers. While 
positive effects were noted overall, the data revealed that work factcrs tended to 
spill over and adversely affect the teachers’ personal life domain. Roughly about 
two-thirds of the data used for this analysis were related to the negative pattern 
described above. Not surprisingly, many of the factors discussed—excessive work 
demands, relationships with colleagues, salary, and job status—have been impli- 
cated in stress and burnout studies of American and British teachers (e.g., Blase, 
1986; Cox, 1977; Kyriacou & Sutcliffe, 1978; NYSUT, 1979; Pratt, 1976). 

Studies of Canadian teachers have shown that such problems as work and role 
overload (lack of adequate time) and relations with colleagues are important 
stressors. Other factors, however, salary (and moonlighting) and job status, have 
not been reported as critical to Canadian teachers (Bartell, 1984; Dubois, 1978; 
Friesen & Williams, 1985; Hiebert, 1985; Klas, Kennedy, & Kendall-Woodward, 
1983; Syrotuik & D’Arcy, 1983). Indeed, for Canadian teachers, the latter issues 
may not be problematic in term of teachers’ personal lives. Along these lines, 
Hiebert (1985) reports that while Canadian studies of teacher stress associated with 
demographic differences in the amount of stress or type of stress have demon- 
strated some consistency (e.g., stress related to insufficient time), most studies 
have failed to do so. This suggests that stressors and personal life effects may be 
specific to particular samples of teachers. 

At a more abstract level, examination of the present study data indicate that 
the limiting effect of work factors on personal life occurs in a number of ways: 
First, overload factors related to the amount and difficulty of work appear to 
“drain” teachers physically and emotionally (Marks, 1977). According to the data, 
this directly limits teachers’ time and energy for family, friends, and community. 
Work overload factors also contribute to the level of conflict in and disruption of a 
nurturing family structure. The limiting effect on the teachers’ work life is further 
exacerbated by the similar requirements of the parental and teacher roles which 
seem to compound the problem of “getting away from the job” (compartmentaliza- 
tion). Thus, for teachers, effective disengagement and opportunities for recupera- 
tion tend to be particularly difficult. Second, salary level and corollary problems 
such as moonlighting limited teacher and family pursuance of personal interests. 
Intellectual aesthetic, recreational, and other activities were notably circumscribed 
because of low salary levels. 

Third, the omnipresent moral expectations of the community deprived teachers 
of anonymity and made difficult involvement in and enjoyment of certain conven- 
tional social activities. This further minimized involvements outside of work. In 
many instances, unreasonable expectations were reported to have actually isolated 
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teachers from the communities in which they lived. Fourth, low occupational status 
was reflected in lack of appreciation and understanding on the part of community 
and society. Again, this alienated teachers from their communities and negatively 
affected teacher self-esteem, motivation, and commitment to work. 


From the data described in this report, it appears that a fundamental conflict 
exists between the professional life and personal life demands of teachers. Indeed, 
it would seem that teachers often resolve this conflict in favor of professional 
demands. Functionally, time and energy commitments required by the job 
systematically preclude teacher participation in a wider range of familial, social, 
cultural, and political activities outside of the work setting. Stated bluntly, the pro- 
fessional life domain of the teacher frequently exists at the expense of the personal 
life domain. 

It is understandable that perceptual data drawn from teachers would lead to 
the foregoing conclusion. According to Gouldner (1966) social exchanges in the 
American culture tend to be viewed within a reciprocation framework. That is, ex- 
changes between individuals and others (e.g., institutions) are evaluated in terms of 
contingent benefits. When, for example, one’s inputs are perceived to exceed one’s 
outcomes (benefits), an unequal (or unjust) exchange exists. This may be referred 
to as “reciprocity imbalance.” Gouldner argues that such imbalances destabilize so- 
cial interaction and social organization. The data presented in this article (corrob- 
orated by research in other areas, such as teacher satisfaction and teacher stress) 
support the conclusion that, in the view of a significant percentage of teachers, un- 
equal exchanges and reciprocity imbalance exist when the professional life and per- 
sonal life domains interact. 


Coser’s (1974) classic discussion of “greedy” institutions captures an underlying 
element of the reciprocation problem analyzed from the data. Coser states: 
Organized groups are always faced with the problem of how best to harness human ener- 
gies to their purposes . .. Competition for loyalty and commitment is a perennial problem 
because these are scarce resources. Not only do human beings possess only finite libidinal 
energies for cathecting social objects, but their resources of time are similarly limited. As 
a consequence, various groups having a claim on individual’s energies and time compete 
with one another in the effort to draw as much as they can, within normative limits, from 
the available pool of resources. The struggle over their allocation is as much a root fact of 
social life as is the competition of users of scarce resources in economic affairs. (p. 1) 
Greedy institutions, Coser argues, make total claims on their members. They at- 
tempt to gain “undivided loyalty” and reduce the claims of competing groups. Cer- 
tain religious sects, utopian societies, political groups, and family structures are 
defined as greedy institutions. 


To be sure, the occupation of teaching has not evolved historically to require 
the level of commitment (totalistic) from its members suggested by Coser’s concept 
of greedy institution. Clearly some degree of reciprocity exists. Nevertheless, the 
greedy institution concept is useful in sensitizing us to a moral issue: That time 
and energy, and moral, intellectual, and emotional demands typically placed on 
teachers may be inequitable with regard to compensation, status, recognition, and 
other such issues identified in the present research. If it can be argued that society 
has drawn from teachers beyond that which it has fairly returned (i.e., financial, so- 
cial, and status rewards), and in fact, societal expectations work to undermine im- 
portant familial and social structures, then the occupation of teaching may indeed 
be considered a greedy institution. It is our belief that society continues in its fail- 
ure to comply with a basic principle of reciprocation from which individuals and 
groups judge interactions as just or unjust (Gouldner, 1966). 
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The purpose of this article is to consider the problem of the small school district from 
a new perspective. We first examine the policy goals likely to be held by provincial 
ministries (or state departments of education) to try to understand the past popular- 
ity of district amalgamation as a policy mechanism, what problems it is intended to 
address, and why it is thought to be the appropriate solution to these. We contend 
that the rationale for amalgamation is based on myths about the small school district, 
and in the next section we attack these myths, using longitudinal British Columbia 
data in a series of statistical analyses and some case study data and interpretations. 
Finally, we argue that the essence of the problem in British Columbia is mean school 
size, not district size, and that consequently the solution lies not in school district 
amalgamation, but in revisions to the way in which school districts are funded—the 
“fiscal framework” recently adopted by the Ministry of Education. 


In 1945 there were 650 school districts in British Columbia; in 1984 there were 75. 
Despite the great increase in the number of students during that period, the num- 
ber of districts dropped drastically, the result of several rounds of amalgamations. 
For example, an early Royal Commission report called Survey of the School 
System recommended that “consolidation of assisted schools be carried out 
wherever it seems educationally or financially desirable” (Putnam & Weir, 1925, 
p. 32). Other Royal Commission reports on education (Cameron, 1945; Chant, 
Liersch, and Walrod 1960; King, 1935) have urged a similar course of action. In the 
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early 1970s the British Columbia School Trustees’ Association commissioned five 
case studies of school district organization (Coleman, 1971; Levirs, 1971; 
Plenderleith, 1971; Robinson, 1970; Scribbs, 1971). These reports also favored con- 
solidation. This phenomenon was not limited to British Columbia. Between 1940 
and 1980, the number of students in American public school systems rose from 
25,434,000 to 41,645,000; the number of school districts fell from 117,108 to 15,912 
(National Center for Education Statistics, 1983). 

Given this history of amalgamation in British Columbia, one might expect the 
variation in district size to be limited and the number of small districts to be few. 
In fact, neither of these expectations holds, as the data on school district size pre- 
sented in Table 1 demonstrate. (These figures are based on the Statistical Report 1 
of the 1984 BCSTA analysis of school district budgets.) School districts range in 
size from about 500 to 50,000. Although the mean district size is 6,380, fully half 
the districts in the province have fewer than 3,600 students. In fact, over one 
quarter of the districts have fewer than 1,500 students, and not even one fifth of 
them have more than 10,000 students. 


No matter how one defines a “small” school district, then, there are obviously 
many in British Columbia. They continue to exist in spite of repeated waves of 
consolidation. Yet amalgamation still appears to be an attractive option to policy- 
makers. As recently as the spring of 1983 consolidation of districts was once again 
being considered as a policy option by the Sager Commission appointed to investi- 
gate the relationship between district size and operating costs as an aspect of the 
government restraint program. The Commission was dissolved, however, before re- 
porting. In our ongoing study of school district ethos, trustees and senior adminis- 
trators frequently tell us that the threat of amalgamation is always in the 
background. Certainly the frequency with which amalgamation is cited as a 
solution to a variety of problems suggests that small districts are seen as undesir- 
able. 


The purpose of this article is to consider the problem of the small school dis- 
trict from a new perspective. We first examine the policy goals likely to be held by 
provincial ministries (or state departments of education) to try to understand the 
past popularity of district amalgamation as a policy mechanism, what problems it 
is intended to address, and why it is thought to be the appropriate solution to 
these. We contend that the rationale for amalgamation is based on myths about the 
small school district, and in the next section we attack these myths, using longitu- 
dinal British Columbia data in a series of statistical analyses, and some case study 
data and interpretations. Finally, we argue that the essence of the problem is mean 
school size, not district size, and that consequently the solution lies not in school 
district amalgamation, but in revisions to the way in which school districts are 
funded, the “fiscal framework” recently adopted by the Ministry of Education. 


The Policy Goals of Amalgamation 

Mitchell and Encarnation (1984) identify three competing educational policy goals: 
efficiency, or a concern for cost effectiveness; quality, or a concern for student 
achievement; and equity, or a concern for both equality of educational opportunity 
and local control. They maintain that it is possible for a policy to facilitate the pur- 
suit of all three goals, although generally only one of the goals is dominant at a 
time, often at the expense of the other two. Amalgamation can be considered as a 
policy option from the perspective of each of these three goals. 
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TABLE 1 
School District Size in British Columbia, 1984 


a 


District Size No. of Districts 

OF 2 1500 20 

1501 - 3000 12 

3001 - 4500 10 

4501 - 6000 10 

6001 - 10000 9 
10000 - 15000 4 
15001 - 20000 4 
over 20000 4 

mean= 6380 


median = 3606 


Efficiency 
Consolidation is generally thought to contribute to efficiency through economies of 
scale, based on the assumption that as size increases, unit costs decrease. 
Proponents of amalgamation argue that such economies are realized mainly 
through lower administrative costs, lower maintenance costs, higher pupil/teacher 
ratios (and consequently lower teacher costs), and bulk purchasing. Certainly the 
reports of school district reorganization in British Columbia mentioned earlier ac- 
cepted this premise: 
The Commission considers that the consolidation of such districts would result in econo- 
mies regarding administrative and some other costs. (Chant et al., 1960, p. 58) 
There is a strong, though not completely linear, relationship between school district size 
and economy of school district operation. (Robinson, 1970, p. 23) 
There are few alternative explanations for the difference in costs between large and small 
school districts which explain the fact as satisfactorily as the view that large districts 
achieve substantial economies of scale in their educational operations. (Coleman, 1971, 
Daas) 

The argument that consolidation would result in economies of scale is so com- 
pelling that it is rarely challenged. Instead, debate has tended to center on more 
technical issues such as the ideal unit size (see, for example: Cameron, 1945; Edu- 
cational Research Service, 1974; Robinson, 1970) rather than on empirical evidence. 
Sher and Tompkins (1977) cite a number of studies which fail to find these econo- 
mies of scale, and which even find evidence of diseconomies of scale due to, for ex- 
ample, increased transportation and distribution costs. They conclude that the 
success of an amalgamation policy, at least in terms of efficiency, depends on local 
circumstances, including population density and valuation levels; savings cannot be 
assumed. 

In spite of some recent challenges to the assumption that district amalgama- 
tions result in economies of scale, it is still a powerful and well accepted policy 
mechanism. Its popularity with governments rests, of course, on the assumption 
that small school districts are inefficient, that their costs per pupil are necessarily 


higher than those of larger districts. 
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Quality 

The quality of the educational program in small districts is often discussed in 
terms of either resources available or social climate. They reflect two entirely dif- 
ferent approaches to assessing quality. 

Proponents of amalgamation favor the former. They argue that consolidation 
contributes to the quality of the educational program by providing more and better 
resources. These include more qualified and experienced teachers, better facilities 
(laboratories, shops, libraries, gyms), a wider range of courses, and student support 
services such as counseling. As a result of all these resources, the argument con- 
tinues, students learn more. 

Two Canadian studies (Dibski, Lucas, Renihan, & Skaalid, 1983; Gleadow & 
Bandy, 1980) have found that the provision of student and teacher support services 
in rural districts is often inadequate, at least from the perspective of the teachers. 
Dunne (1977) reports similar findings in the United States. While there is some 
evidence that the resource levels of small districts are not as high as those of large 
districts, particularly with respect to facilities, this does not mean that their 
schools are poorly equipped (Gleadow & Bandy, 1980) and unable to offer a com- 
prehensive program (Sher & Tompkins, 1977). 


Parents and educators in small districts value qualitative aspects of the educa- 
tional experiences: the slower pace and less pressured environment, and the closer, 
more informal ties between teachers and parents (Dunne, 1977; Gleadow & Bandy, 
1980; Rosenfeld, 1977; Sher & Tompkins, 1977); the warmer, more supportive rela- 
tionships between the teachers and the student (Dibski et al., 1983; Dunne, 1977); 
and the greater individualized attention and instruction given to the students 
(Dunne, 1977). These concerns are not weighted as highly as those pertaining to ef- 
ficiency and resource levels when amalgamation decisions are made: “the benefits 
of consolidation seemed overwhelming and the costs minimal by comparison” (Sher 
& Tompkins, 1977, p. 45) even though they are very important to the people af- 
fected by the decision. 


The view that small school districts do not provide the same quality of educa- 
tion as larger districts is less widespread than the previous myth, probably because 
achievement data are less readily available than cost data. This argument in favor 
of amalgamation is based on the myth that small school districts are ineffective, 
that they provide lower quality education than larger districts. 


Equity 

Traditionally, the goal of equity has been pursued through provisions for public 
participation in governance, particularly at the local level. Local autonomy was the 
hallmark of equity. Since the mid-1950s, however, equality of educational oppor- 
tunity has been a major policy thrust. Thus the policy goal of equity has two quite 
distinct meanings. 


Proponents of amalgamation argue that students will benefit from the ex- 
panded opportunities that amalgamation permits: the wider range of course 
offerings, the contact with a more varied group of teachers and students, the expo- 
sure to a wider range of experiences, and better preparation for the urban world. 
This argument is based on a concern for equality of educational opportunity, on a 
desire to ensure that students in small districts are offered the same kind of educa- 
tional experiences as their counterparts in large districts. 
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Opponents of amalgamation, on the other hand, do not want their children to 
be “urbanized.” They value the close ties between the school and the community 
and the responsiveness of the system to local needs (Gleadow & Bandy, 1980). As 
Dunne (1977) explains, the issue “is not so much cash as control. Rural parents are 
reluctant to delegate school policy to distant ‘experts,’ and often feel that a ‘good 
education’ is close to home, both physically and academically” (p. 85). This argu- 
ment, then, is based on a concern for local autonomy, particularly over the kind of 
educational experiences the students are offered. 


There has been a long history of conflict between these two aspects of equity in 
British Columbia, particularly when amalgamation is at issue. For example, when 
Putnam and Weir (1925) recommended consolidation of school districts in British 
Columbia, they did so largely on the basis of equality of educational opportunity. 
They urged consolidation to be carried out “with the approval of local boards if 
possible, but in the face of their disapproval if necessary” (p. 32). Similarly, the 
Cameron report (1945) asserted: “We must have more equality than we have: if we 
cannot have it without complete centralization, we shall have to have that too; if we 
can have it and still preserve local control, let us by all means do so” (p. 37). 

The conflict has been particularly prevalent since the 1970s. Local autonomy 
often meant that districts decided how much they wished to spend on education 
and, to some extent, what they spent it on. This, of course, resulted in differences 
in the educational programs offered throughout the province. Concern for equality 
of educational opportunity led to assertions that the province has a responsibility 
to ensure that the school districts, its creatures, did not contribute to inequality so 
that district of residence was associated with different levels of educational oppor- 
tunity for students. This has been accomplished through increasing centralization 
of fiscal and program decisions, thereby limiting local autonomy. 

Another aspect of the amalgamation argument, then, is that small school dis- 
tricts are inequitable, that is, they do not offer the same educational opportunities 
for students that larger districts do. 


The Myths and the Reality 

The mythology surrounding small school districts in British Columbia is very well 
established, having survived a variety of Royal Commission enquiries and numer- 
ous other investigations and reports. It parallels exactly a similar mythology in the 
United States, which has led to “the most successfully implemented educational 
policy of the past fifty years” in that country, the consolidation of small schools 
and small districts (Sher & Tompkins, 1977, p. 43). The three main myths about 
small districts are: 

Small school districts are inefficient—their costs per pupil are necessarily higher than 

those of larger districts; 

Small school districts are ineffective—they provide lower quality education than larger 

districts; and 

Small school districts are inequitable—they do not provide the same levels of educational 

opportunity to students as larger districts. 
Detailed discussion of these myths will inevitably require the introduction of an as- 
sociated issue: that of school size. 
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Myth 1: Inefficiency 

The costs of providing educational services are largely the costs of providing 
trained teachers. For example, in 1982 teacher salaries made up 76% of district 
operating costs in British Columbia. Teacher costs have two parts: salary levels and 
intensity (the ratio between students and teachers). In British Columbia, there is 
no relationship between district size and teacher salary costs per pupil. (For infor- 
mation on the sources of data, and technical information on analyses, see Coleman 
& LaRocque, 1984. For all correlations reported here, p<.001.) In effect, this 
means that salary schedules do not vary with district size. However, there is a dis- 
tinct relationship between school size and teacher salary costs per pupil. The 
Pearson correlation coefficient is .45. Similarly, there is a strong relationship be- 
tween school size and pupil/teacher ratio (PTR): the coefficient is .65. These rela- 
tionships, which are obviously interactive, have remained stable over several years. 


As a consequence of these relationships between school size and teacher costs, 
there is a strong association between school size and overall operating costs: For 
1981 the coefficient was —.58; for 1984 it was —.57. Similarly, there is a strong and 
stable relationship between costs and PTR. Most importantly, there is an even 
stronger relationship between PTR and school size: .70 in 1981, and .65 in 1984. 
When they are considered together, these two variables, school size and PTR, 
accounted for 84% of the variance in operating costs between school districts in 
1982. The impact of these variables considered jointly is much more important 
than either alone, and has been strengthening. 


Small schools are very expensive to operate, because they require much lower 
PTRs than larger schools. This view is consistent with the British Columbia data 
and with the experience of educators; additionally, it seems to hold true in a 
variety of jurisdictions (Sher & Tompkins, 1977, p. 50). It is crucial to note that it 
is in conjunction with small schools that low PTRs become a major predictor of 
costs for school districts. Small schools are cost-inefficient. Because small schools 
occur disproportionately in small districts (the correlation between district size and 
school size for 1981 was .63), this school level inefficiency is at least a, and perhaps 
the, major cause of high costs in small districts, and is an extremely important ele- 
ment in any discussion of district efficiency. 


This set of relationships for British Columbia is represented in Figure 1. In this 
representation the bracketed numbers show the strength of the association for 
1984, the arrows show a posited causal connection and direction, and the double 
lines show an association which is not causal, but a consequence of other relation- 
ships. 

Because the small districts of the province tend to contain the small schools, it 
seems at least possible that the prevailing opinion that small districts are ineffi- 
cient is based on a confusion between district level and school level problems. 
When the effects of school size are statistically eliminated from the relationship be- 
tween district size and per pupil costs, the resulting correlation coefficient is .05 for 
1981 and .02 for 1984. These relationships are statistically and practically meaning- 
less. 


The simple conclusion is obvious: Small districts are exactly as efficient as 
larger ones. Small schools, however, are inefficient. To say that small districts are 
as fiscally efficient as larger ones may conceal the fact that in order to achieve this 
level, given the penalty of small schools, small districts have had to divert spending 
from services and activities common in larger districts. This spending in small 
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(.65) eH) 
SCHOOL SIZE——_——— > PTR—— —— > OPERATING COSTS 
(.45) 
(.63) DISTRICT SIZE C= .37) 


Figure 1: The relationship between district size, school size, PTR, and operating costs in Bri- 
tish Columbia in 1984. 


districts supports the inevitably lower PTRs required by the small schools. In the 
graphic representation of relationships above, the correlation between district size 
and PTR is .45: low PTRs are found disproportionately in small districts. The 
important policy issue here, then, is how best to deal with the financial problems 
created in school districts by small and fiscally inefficient schools. 


Thus, small districts are at present providing lower levels of services of various 
kinds, probably in order to maintain the fiscally inefficient small schools. This 
issue will be discussed subsequently, in relation to the equity issue. 


Myth 2: Ineffectiveness 

One of the strongest and most important relationships in education is that between 
family background and student achievement levels. Since the Coleman et al. Re- 
port on Equality of Educational Opportunity (1966) we have known that family 
background, and particularly family educational level, is the best single predictor of 
student achievement, far more important than any other single factor. 


For British Columbia, the best data available suggest that between 12% and 
40% of the variance in test scores (depending on subject area and grade) are 
accounted for by family background variables (Coleman, in press). Furthermore, 
within the province some districts have very much higher aggregate levels of family 
education than others. In particular, there is a strong and stable relationship be- 
tween district size and family education level (1981: r=.43; 1984: r=.41; Coleman, 
in press). Thus interdistrict comparisons which do not take family education levels 
into account are invariably biased against small districts. 

When the effects of family education levels are controlled statistically, the cor- 
relations between district size and student achievement are .05 for both 1981 and 
1984, and, as before, are statistically meaningless. Similarly, the correlations be- 
tween school size and student achievement are .21 and .08, and not statistically sig- 
nificant at the .01 level (Coleman, in press). 

Evidence of a different kind is also available. A case study by Youngberg (1986) 
examined the relationships between achievement levels in basic education courses 
for two cohorts of students. The first group consisted of 96 students who completed 
the junior secondary grades in a group of rural junior secondary schools in a north- 
ern school district. The second consisted of a group of 96 students who attended 
the junior secondary grades in the urban schools of the same district during the 
same period. A wide variety of data were collected and analyzed, including student 
surveys, records of achievement, and socioeconomic data on family backgrounds. 
The rural students had noticeably lower family education levels and educational 
aspirations; for example 52.6% of the students from rural junior secondary schools 
completed Grade 12 graduation requirements, versus 73.9% for the urban students. 
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However, the rural students received significantly better grades in senior secondary 
school subjects: a mean GPA of 3.76 versus 3.39 (a statistically significant differ- 
ence; p<.01). Further, the students were quite satisfied with their experiences in 
the rural schools. They perceived the preparation they had received for senior sec- 
ondary school work to be equivalent to that of the urban students, and 70.7% of 
them believed that rural junior secondary programs should be maintained. 


This case study provides support and some detailed explanation of the results 
of the province-wide statistical analysis. Small and rural junior secondary schools 
can provide perfectly satisfactory educational programs. To return to the provincial 
level, when the effects of family educational level are taken into account, small 
school districts are precisely as effective as large districts, in terms of student 
achievement. 


Myth 3: Inequity 

The notion that small districts do not provide the same levels of educational oppor- 
tunity as larger ones is the most difficult of the myths to deal with, precisely be- 
cause it is the least clearly articulated. For example, if educational opportunity is 
defined very broadly as cultural, social, and recreational possibilities, then it seems 
indisputable that small communities, in which small districts tend to be located, do 
offer a narrower set of possibilities. If the debate is focused on schooling, then edu- 
cational opportunity can be defined as access to a good quality of basic educational 
programs which provide preparation for postsecondary or career possibilities. The 
fact that student achievement scores in basic program areas, when aggregated to 
district levels, are not associated with district size, seems to suggest that at least 
one form of educational opportunity is distributed across the province without ref- 
erence to the school district of residence. 


From a parental perspective this situation is probably moderately satisfactory. 
Again, case study data are available. Wicharuk (1986) studied current parental per- 
ceptions of a school board ‘decision taken in 1979 to decentralize junior secondary 
programs from an urban center to a group of rural elementary schools. A series of 
open-ended interviews provided the data. A very considerable change of opinion 
amongst parents had taken place. With one exception, this change of opinion was 
favorable to the rural secondary option. More interesting were the reasons given for 
the positive view of the rural option. A typical comment was “C. (school) teaches 
students how to work and gives them better academic preparation for senior high 
school than do the urban schools.” With regard to electives: “it would be nice to 
have more choice in electives, but it’s more important that the students get the 
basics that they need for high school.” The most significant factor emerging to ac- 
count for the rather high level of parent satisfaction, according to Wicharuk, was 
the provision of a good basic program with high academic standards. 

With respect to academic opportunities at the school there was almost unanimous ap- 
proval and support. ... For many who had originally opposed the secondary division, it 
was the factor which reversed the earlier opinion. Indeed, one of the most frequent com- 
ments made by parents was “Look at the honor roll in town. They must be doing a good 
job here for so many of our kids to do so well there.” (p. 64) 
Commonly, both district and school consolidation has been strongly resisted by 
parents, unconvinced by the advantages of larger units. The reverse, at least in one 
British Columbia school district, is not the case. 
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Another definition of equity focuses on service level equity. In discussions of 
funding in the 1970s this became the central issue (Coleman, 1972a). It has already 
been noted that small districts in British Columbia are probably providing lower 
levels of services of various kinds, in order to maintain the fiscally inefficient small 
schools. (Such lower levels are apparently common; see, for example, Dunne, 1977). 
An illustration of this problem is provided by Baldwin’s (1986) description of 
teacher assignments in a small rural elementary/secondary school: 

Out of the six FTE teachers, 1.4 were utilized at the junior secondary level. The following 
shows the teaching assignments of the two teachers involved: 

Teacher A - English 8, English 9, English 10, Math 8, Math 9, Math 10, Science 8, Science 
9, Science 10, Phys. Ed. 8, Phys. Ed. 9, Phys. Ed. 10, Foods and Nutrition 9, French 8, 
French 9, French 10. 

Teacher B - (Principal) - Social 8, Social 9, Social 10, Typing 9/10, Drafting 8. 

Although Teacher A’s load includes 14 courses, this individual did not in actuality have 
that many courses to prepare for and teach. As will be seen later, some of these courses 
were collapsed by subject area in order to make the assignment more manageable. Teach- 
er B, the Principal, also taught some elementary level programs. (Baldwin, 1986, p. 112) 


Such teacher assignments are rare in large urban districts. 


With regard to equity, the situation seems to be as follows. In terms of basic 
educational programs, there is a reasonable level of equity; from the perspective of 
parents, it is probable that this situation, to the extent that it is known, is or would 
be considered satisfactory. With regard to service levels, there has existed a sub- 
stantial degree of inequity, probably associated with the fiscal difficulty districts 
experience in operating small schools. (Very recent data suggest that this problem 
has been recognized by provincial authorities; this will be the subject of a follow-up 
report.) Amalgamating small districts into larger ones does not resolve these fiscal 
difficulties, it only makes them less apparent to all except the district and school 
administrators who must cope with them. 


Policy and Research Implications 

Amalgamation as a Policy Option 

The traditional solution to the problem of small districts has been district consoli- 
dation or amalgamation. In essence, there are three difficulties with district amal- 
gamations as a policy option. The first has to do with the rationale. As has already 
been suggested, in British Columbia small districts are as efficient and effective as 
larger ones. Thus no increase in either efficiency or effectiveness could reasonably 
be expected to follow from amalgamations. 

The second has to do with practicality from a geographical perspective: Just as 
small schools in certain geographical locations are inevitable, so too are small 
school districts. To use one of the more obvious examples, if the school district of 
Central Coast did not exist at present, it would be necessary to invent it, or some- 
thing very like it. Similarly, if the small school at Rivers Inlet did not exist, it 
would have to be created. 

These assertions rest on a very important assumption: So long as the govern- 
ment of the province remains committed to making public school education avail- 
able to every child of school age in the province, and to local control of the delivery 
of educational services, small schools and small districts must exist given the cur- 
rent population distributions in small centers in British Columbia. 
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The third difficulty concerns governance: Small school districts have reputedly 
always been able to maintain a close and mutually satisfactory relationship with 
the communities they serve. As has already been pointed out, small communities 
tend to value a positive educational climate more highly than other kinds of educa- 
tional benefits such as access to a wide range of electives or to extensive specialized 
services. Perhaps the best evidence of the general satisfaction of small communities 
with their small rural district is the opposition usually expressed to amalgamation 
proposals. In amalgamating districts out of existence, there is a strong possibility 
that community pride and autonomy will be badly damaged, with both educational 
and political repercussions. From a purely educational perspective, local commu- 
nity support of particular kinds is demonstrably associated with high levels of 
student achievement, at least in other jurisdictions. (Fullan, 1982, provides a review 
of this line of research.) 


It is possible, even probable, that negative consequences of both political and 
educational types would follow from further school district amalgamations. Given 
the demonstrated relationship between district size and district efficiency, effec- 
tiveness, and equity, it is hard to imagine why such amalgamations might be con- 
sidered. However, if small districts are to continue to exist, some attention must be 
paid to the problem of small schools. 


The Problem of Small Schools 

The relative fiscal inefficiency of small schools in British Columbia has already 
been discussed. Given the geographical factors which often determine the need for 
the small schools, a major increase in the mean school size in the province is not a 
realistic expectation. No doubt the clear realization that the inefficiency of small 
schools is closely associated with the PTR may make some better administrative 
decisions possible, but small schools will continue to be inefficient. 


However, a consideration of the effects of small schools on their clients might 
help balance this picture of relative inefficiency. The view that small schools are in- 
effective, as well as inefficient, rests largely on analyses of secondary schools. Sher 
and Tompkins (1977) have shown that these have been misinterpreted, and that 
statistical data do not support the view that small secondary schools are less effec- 
tive than large ones in the United States. Similarly, Rosenfeld (1977), in a case 
study of centralization in Vermont, questions the supposed better preparation of 
students in larger schools. He also demonstrates that occupational aspirations tend 
to be higher amongst students from small high schools, despite the lower levels of 
socioeconomic status of the students. Husen (1985), long associated with the IEA 
international and longitudinal comparisons of student achievement, has recently 
argued that “we must take steps to make schools smaller” (pp. 354). 


For British Columbia, small schools seem to be as effective as larger ones. 
Youngberg’s (1986) careful comparisons of school achievement for two cohorts of 
students certainly do not support the general view that rural secondary schools are 
ineffective. Wicharuk’s (1986) analysis of parent perceptions suggests that rural 
parents are making careful judgments about school quality on the same basis as 
urban parents, that is, the quality of the basic education program of the school. On 
this basis, the parents she interviewed were generally satisfied with their small 
rural schools. Baldwin (1986) suggests that the problems of small schools are large- 
ly resource problems. 
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At the provincial level, small schools occur mostly in small districts, and when 
family background factors influencing achievement are controlled, small districts 
and small schools are as effective as large ones, as has already been pointed out. 
There is no reason to believe, on the basis of either the district level case studies 
reported here or the province-wide analysis, that small schools are less effective 
educationally than large ones with respect to the basic education programs impor- 
tant to parents. 


It is almost certainly the case that small schools operate in different ways from 
large ones (Barker & Gump, 1964); it seems as likely that the advantages of small 
schools—personalization, community support—offset the disadvantages of limited 
supplies and equipment and restricted teacher specialization. Given some 
reconsideration of the resource allocation system, the small rural schools might in 
fact be demonstrably superior educational environments, worth emulating in urban 
settings. 


The available evidence suggests, then, that small schools are educationally ef- 
fective, though fiscally inefficient; that there are often compelling community and 
geographical reasons for maintaining them; and that they occur in small districts in 
British Columbia and constitute a serious financial burden for these districts. The 
provincial commitment to equitable educational services, delivered locally, entails 
an obligation to reexamine the funding formula which presently limits the avail- 
ability in small districts of the resources necessary to operate small schools. 


Conclusions 

The general conclusion of this analysis is that small districts in British Columbia 
are as efficient, effective, and equitable as larger ones. They suffer from serious re- 
source problems associated with the inevitably higher costs of operating small and 
remote schools. Currently these costs are not acknowledged in the provincial 
funding system for school districts. The strengths of small districts and small 
schools are not those of larger systems. Yet they do serve their communities well, in 
ways judged important by these communities. Given equitable access to provincial 
resources, these small units may indeed become models of educational quality. In 
public educational organizations in British Columbia, small is at present as beauti- 
ful as large; in future it may become clear that it is indeed more beautiful than 
large. 

Applying data from a province level case study to the wider educational scene is 
difficult. However, the data from British Columbia suggest that the apparent 
inefficiencies of small school districts are due not to their size, but to the costs they 
incur from operating small schools. It is not district size, but mean school size, that 
is the problem. As long as the confusion between district level and school-level fac- 
tors exists, the appropriate steps to alleviate the fiscal difficulties of the small 
school districts will not be taken. British Columbia is not the only province in 
which amalgamation is seen as the solution to the problem of the small school dis- 
trict, and it is likely that the confusion between district level and school level fac- 
tors exists elsewhere. The nature of the fiscal difficulties of small school districts in 
other jurisdictions must be clarified, perhaps by replicating the analyses described 
here. Then attention can be focused on developing other ways of funding school 
districts so that those districts with small schools are treated equitably. 


A more speculative suggestion, based on long-standing concerns | (Coleman, 
1972b) about the efficiency and effectiveness of education in large districts, is that 
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the whole issue of the size of administrative units requires reexamination using the 
new qualitative methodologies. We suspect that there is little positive justification 
for units which include many thousands of students, hundreds of teachers, and 
dozens of schools, and much that is negative about them. 
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BOOK REVIEWS 


THE LEADERSHIP CLASS IN SCOTTISH EDUCATION. By Walter 
M. Humes. Edinburgh: John Donald Publishers, 1986, 228 pp. 


As the first major book on Scottish education for well over a decade, this study is 
deserving of much attention. It is not, however, a conventional study with chapters 
on the various levels of schooling and a historical introduction to illuminate the 
contemporary situation. There is little here about what actually goes on in schools. 
Instead the book is a sustained attack upon the administration and control of Scot- 
tish education. The bureaucracy is indicted not for corruption, but for authoritar- 
ianism, complacency, intolerance to criticism, and resistance to democratic change. 
The book is not entirely easy to read; it is spattered with acronyms and 
abbreviations so that the reader would be quite out of his depth without the key of 
61 items which is provided at the beginning. Yet the book is well written and the 
case is argued with formidable skill. 


The author begins by debunking the “received wisdom” according to which 
Scottish education proceeds by cooperation between central and local government 
with “a multiplicity of bodies and groups” involved in the educational partnership. 
Teachers are held to enjoy much greater freedom than their colleagues south of the 
border and, in general, Scottish education is seen as more democratic and egalitar- 
ian than the English system. It is also regarded as an integral part of a unique 
Scottish culture and an effective instrument for resisting assimilation by the Eng- 
lish. In the author’s opinion, decentralization and democracy are a myth in Scottish 
education and sentimental nationalism serves only to induce complacency and pre- 
vent a serious examination of underlying theoretical issues. 


The author then proceeds to develop his thesis that a relatively small group of 
people, “who all know each other,” fix the national agenda and that Scottish educa- 
tion must be seen within the context of this fact. As far as education is concerned, 
the leadership class comprises senior civil servants in the Scottish Education De- 
partment (SED), members of the inspectorate, local Directors of Education, and 
the leading officials in various educational bodies. It would be wrong, cautions Dr. 
Humes, to suggest that bureaucrats are all powerful and completely isolated from 
outside pressures, but increasingly, in all the western countries, the politicians are 
finding it difficult to constrain the officialdom which ostensibly they control. 

The various sectors of the leadership class are examined in considerable detail 
with the SED receiving particular attention. In practice, SED officials have great 
freedom of action and they proceed (as bureaucrats usually do) not by any 
consistent principle but rather by reacting to pressures from interested parties both 
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within education and without. Reforms may indeed come, as they have in recent 
years, but inevitably the most successful pressure groups are those which know the 
system rather than those with the best case. Also, the judgment of officials is inev- 
itably influenced by their own privileged social origin and the fact that they consti- 
tute a powerful interest group themselves. The author is not impressed by SED 
performance, and he thinks the situation is made worse by the poor quality of re- 
gional councilors in Scotland. Chosen for their party loyalty rather than their abil- 
ity, they are quite unable to defend local interests against the centralizing 
tendencies of Scottish government. Moreover, both they and the Directors of Edu- 
cation have a conservative interest in that greater grassroots participation in 
decision-making would diminish their power. 


The Scottish inspectorate is also subjected to searching and unflattering en- 
quiry. According to the “received wisdom,” the inspectorate has done more than 
any other single body to exercise a beneficial and creative influence on Scottish 
education. Undoubtedly, the 69 inspectors are a powerful force, for they can influ- 
ence the promotion of classroom teachers, and, in addition to their reports and 
other publications, they serve between them on nearly 400 bodies of one sort or an- 
other. In the author’s view, however, the inspectors, although often aggressive 
towards teachers, are unduly acquiescent toward government policy because they 
do not wish to jeopardize their chances of promotion. Their reports are “relentlessly 
bland” and their publications on specific topics are of limited use because they are 
often evaluating policies which they themselves helped to introduce. The inspectors 
constitute a very prestigious segment of the leadership class and are, therefore, not 
agents of progressive change. 

Another highly visible body in Scottish education is the Consultative Commit- 
tee on the Curriculum (CCC) which was established with high hopes in 1965 and 
reconstituted in 1976. There is a full-time secretariat headed by a member of the 
inspectorate, specially seconded for the purpose, and all appointments to the CCC 
are made by the Secretary of State on the recommendation of various interested 
parties. Our author is predictably skeptical of such a structure, especially as ac- 
ceptable service on the CCC is seen as a stepping stone to other appointments in 
the gift of the Secretary of State. Some critics, he declares, see the Committee as “a 
front organization . . . to provide a cloak of respectability for SED-imposed direc- 
tives.” Certainly no serious disagreements among CCC members are aired in public, 
and the desire for consensus produces “a bland elision of many substantive issues.” 
The CCC is viewed as a broken reed and little good can be expected from it. 

The author’s analysis is perhaps at its deadliest in his discussion of “the false 
god of professionalism” among teachers. Several teachers’ associations exist in Scot- 
land, varying in size, clientele, and militancy. The largest is the Education Institute 
of Scotland (EIS), committed to the advancement both of “sound learning” and of 
the interests of teachers. However in practice these two aims are not always com- 
patible. Rank and file members of all unions are mainly concerned about salaries 
and conditions of work and, therefore, the purely educational policies of the union 
are left to the leaders and the zealots who serve on union committees. These 
people, though usually on the left politically, are easily assimilated by the leader- 
ship class. As to the classroom teachers, their anxiety for promotion to senior 
teaching posts or to administrative appointments constrains them from radical 
action and encourages a false professionalism which is loyal to the status quo. The 
author is also skeptical of what is ostensibly the great progressive innovation of 
Scottish education, namely, the General Teaching Council (GTC) which purports 
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to be a self-regulating professional body like that existing in other professions. The 
Council consists of 49 members and 30 of these are directly elected by teachers. 
There was great public interest when the Council was first established in 1965, but, 
according to Dr. Humes, this interest has declined sharply because of the Council’s 
tendency to react but not to initiate, its reluctance to act against bad teachers, and 
its domination by College of Education principals. Also, the elected members seem 
to owe their allegiance to the political party or union which secured their election 
rather than to the Council or the education service as a whole. This strengthens the 
hand of the SED in dealing with the Council, and altogether the Council provides 
no threat to the power of the establishment. 


The book’s final salvoes are fired at the value and integrity of educational re- 
search. The government pretends to be interested in promoting independent re- 
search but in fact it is only interested in “limited answers to limited questions.” 
There is little encouragement given to researchers who wish to address the 
historical and philosophical questions which might open up radical lines of 
thought. Dr. Humes is critical of the academic community for compromising its in- 
dependence in order to obtain funding and so advance individual careers. 
Worthwhile research, he remarks, “needs to be courageous” and the first require- 
ment is to eschew government funding. 


In drawing his threads together, the author entitles his concluding chapter 
“From Demolition to Reconstruction,” but there is only a very bare indication of 
the “reconstruction” which the author would like to see. He would recommend 
fixed term contracts for senior bureaucrats, reform of the promotion system in 
schools, improved mechanisms for public accountability, and the end of 
monotechnic institutions for training teachers. Hopefully, such changes would “re- 
assert the privacy of the contract between teachers and learners—and offer scope 
of the kind of ability and imagination that are at present discouraged.” But, be- 
yond this, the author resists the temptation to elaborate on possible reforms. In his 
view, it would be premature to do so until the nature of the present evil is under- 
stood by the public. It is this understanding which he seeks to promote in his book. 
Dr. Humes emphasizes that he is not suggesting a deliberate conspiracy on the part 
of the leadership class. On the contrary, they are not fully conscious of the pro- 
cesses at work and “happily succumb” to the conventions to which they have grown 
accustomed. Yet their psychological evasions and the “Pavlovian reinforcement” of 
the patronage system at all levels produce a situation which serves neither the 
pupils nor the community at large. The ills of Scottish education, of course, cannot 
be divorced from those of Scottish society as a whole, but while Humes acknowl- 
edges this, he does not elaborate. He does not attribute the problems in any signifi- 
cant degree to the English connection, nor does he anticipate any great benefit 
from devolution in itself, should this be achieved. 


Some readers will doubtless express some puzzlement that such problems can 
exist in a country like Scotland which is dominated politically for the most part by 
Labour, the self-proclaimed party of the common man. Certainly one must wonder 
whether Scottish educational administration is entirely as it is painted here. One 
must wonder whether Dr. Humes would find anything much to his liking if he 
turned his attention south of the border, or indeed to any country in the western 
world. Is educational democracy really such a sham? Need one necessarily be skep- 
tical of “limited answers to limited questions” in educational research? One must 
wonder, too, whether true democracy and real local participation, if they could be 
achieved in education, would in fact produce all the reforms which Dr. Humes 
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would doubtless hope to see. For Demos is a fairly conservative animal and often 
settles for the system which he knows. It may be that radical reform if such is de- 
sired is more likely to come by decree than by participation. However, this is a 
vigorous and thought-provoking book which breaks new ground, and its relevance 
is certainly not limited to Scotland. It will delight educational radicals and should 
be compulsory reading for administrators the world over. 


D. Ross Pugh 
Department of Educational Foundations 
University of Alberta 


LEARNING AND DEVELOPMENT: A GLOBAL PERSPECTIVE. By Alan 
Thomas and Edward W. Ploman. Toronto, ON: Ontario Institute for Studies in 
Education, 1986, 222 pp. 


Based on the proposition that the term “learning” represents a “more fundamental 
intellectual discipline” than education, building on the recent use of the term 
“global learning” by the United Nations University and recognizing the increased 
interest in learning as a prime factor in national development, the Symposium on 
Global Learning was arranged in the spring of 1985 at the Ontario Institute for 
Studies in Education, Toronto. It was sponsored jointly by that institution, by the 
UN University, and by the Canadian International Development Agency. Forty-two 
individuals from 21 countries were invited to consider the concept of learning as 
distinct from education with the expectation that such an orientation would avoid 
some of the stereotypes, conventions, and clichés associated with traditional educa- 
tion. Such an orientation was also intended to facilitate a broad exploration of 
learning from a wide variety of disciplines, including psychology, anthropology, 
computer technology, neurology, biology, and communications; all of which fields of 
study and activity are concerned with the ways in which knowledge is transmitted, 
behavior is caused, and individual and social development is enhanced. 

To add a note of immediacy and reality to the discussions on learning, the sym- 
posium focused on the role of learning (or education) in international and develop- 
ment activities. Indeed, one of the key concepts to emerge from this meeting was 
the idea of societal learning, that is, learning which is directed at groups, societies, 
nations, and eventually the entire globe with the hope that as a human race we 
may learn behavior, values, and skills more appropriate to the imperatives which, 
though clearly before us, are unfortunately not always well represented in our edu- 
cational policies and planning. 

This publication collects the major presentations of the Global Symposium on 
Learning and offers new percepts and revised concepts in the related fields of 
learning, education, communications, and development. It does not pretend to be a 
definitive statement on societal and global learning and is best seen as an impor- 
tant addition to the growing literature in this field, building particularly on the 
1972 UNESCO report “Learning to Be” (the Faure Report) and its companion 
volume “Education on the Move” (1975) as well as the sixth report to the Club of 
Rome “No Limits to Learning” (1979). These three publications as well as this 
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recent one emphasize the following points in what they hope is a new and 

stimulating look at learning: | 

1. Learning needs to be seen as something which continues before, beyond, and 
around formal education and is something which occurs in many formal and 
nonformal educational environments. 


2. Learning must be seen by the learner as something which is practical, useful, 
and relevant to contemporary concerns and problems. 


3. Learning must utilize and deal with the impact of technology on society and on 
learning itself. 

4. Careful attention must be given to traditional and indigenous cultures if learn- 
ing, particularly in the developmental context, is to take root in the culture for 
which it is intended. 


5. Learning opportunities must be planned and arranged on a lifelong basis so 
that people at all stages of their life will have an opportunity to learn. 


6. Learning programs need to reintroduce into the learning process the long neg- 
lected themes of morality and ethics, which themes were originally the basis of 
all education and are now almost extinct in educational programming. 


7. Greater use must be made of the arts, of mankind’s creative instincts, of the 
universal needs for meaning and purpose, all of which are mediated largely by 
culture. 


Fortunately, there are some practical illustrations of the kind of learning 
toward which we are being urged. There is, for example, an excellent summary of 
the strategies which are part of the Sarvodaya movement in Sri Lanka. Much has 
been written about this remarkable movement but this account is the most coher- 
ent and comprehensive to date. Equally interesting is the description of the ambi- 
tious Venezuelan experience described by Machado who was minister of State for 
the Development of Human Intelligence in that country between 1979 and 1984. 
This experiment is still underway and has been described as the first time “intelli- 
gence and its development were given a political dimension.” Also included is an 
account of learning among the people of the Dinka culture in the Sudan, presented 
as a living illustration of an educational program the objects of which are to im- 
prove human relationships, further cultural orientation and enhance personal dig- 
nity. There is a detailed account of contemporary adult education programs in 
Shanghai with a description about how such large scale programs can be imple- 
mented. 


The introductory and overview chapters are well worth reading and provide a 
good orientation to the use of some new terms and concepts. For example, the term 
“global” as used in this publication and as applied to learning has a twofold mean- 
ing. It refers first, to an inclusive process which embraces all of the elements in- 
volved in integrated cognition and in preparing the learner with life skills and 
learning habits which will permit him to cope with a range of contradictions and 
large quantities of information. Equally important, global also refers to a better un- 
derstanding of the condition of the world as seen in its cultural, racial, political, 
and social diversity. Global, therefore, is seen as a comprehensive and integrated 
learning strategy as well as an approach to understanding the needs of the age in 
which we live. Not only is the process to be global, its content must be global. 

We must learn how to manage global, complex systems while respecting the automony of 
the processes and elements within these systems. We need to develop the ability of ad- 
justing to and coping with new technology, new demographic patterns, new modes of 
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production, new stages of physical consciousness, new images of the world, and ourselves. 
And, whether we wish it or not, these images must include humanity as a planetary phe- 
nomenon, all of us as stewards in this planet. We must learn how to accept planetary obli- 
gations to present as well as to future generations. (p. xxi) 

It is too early for unanimity about the kind of learning appropriate for this age. 
Nor do we expect a unified curriculum to be developed. But perhaps it is reason- 
able to expect a tolerance for different learning approaches, a recognition of differ- 
ent modes of knowing, a willingness to innovate and, most important, a need for 
approaches to learning which are both global in process and in content. This book 
adds much to the discussion and to our understanding. It will be welcomed by any- 
one interested in new approaches to learning, in the role of learning in national and 
international development, in the role of culture and tradition in learning, and in 
the importance of new technology in understanding ourselves and the world around 
us. 


Glen A. Eyford 
Centre for International Education and Development 
University of Alberta 
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Human Science Research Conference 
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Under the theme “Towards Integration in Human Science,” this interdisciplinary 
conference is sponsored by the School of Psychology, the Faculty of Education, and 
the Department of Philosophy of the University of Ottawa. 


For more information, contact: Dr. Bertha Mook: Human Science Research 
Conference; School of Psychology; 10 McDougal Lane; Ottawa, ON; K1N 6N5. 
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